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A Low-Noise Gallium Arsenide Field Effect 
Transistor Amplifier for 4 GHz Radio 


By R. H. KNERR and C. B. SWAN 


(Manuscript received June 14, 1977) 


A low-noise amplifier for 4 GHz radio has been designed and is in 
manufacture. The noise figure is <2 dB and the gain is typically 10 dB. 
Input and output return losses are 225 dB. The insertion loss with 
failure of either the power supply or the low-noise transistor is typically 
5 to 8 dB. The amplifier uses a single gallium arsenide field effect 
transistor in conjunction with a passive failsafe by-pass network uti- 
lizing circulators. This approach permits the noise figure and the gain 
flatness to be optimized for each amplifier without compromising the 
input and output matches. It is concluded that this single-transistor 
amplifier design has significant advantages both in performance and 
in simplicity over the balanced amplifier design. 


Il. INTRODUCTION 


Gallium arsenide Field Effect Transistors (GaAs FETs) are effecting 
a revolution in both the design philosophy and the performance capa- 
bility of new microwave systems. In addition, these devices can often 
provide an economical means for significantly upgrading the perfor- 
mance of existing systems. Such is the case with the 4 GHz radio system, 
where an RF preamplifier with a maximum noise figure of 2 dB is 
achieved with GaAs FETs. In this application, each common multi- 
channel amplifier permits the output power of typically five transmitters 
to be dropped 4 dB, from 5 watts to 2 watts, while still maintaining the 
system thermal noise objective for 1500 channels. This significantly 
increases the life of the transmitter amplifier triodes, thus improving 
the overall system reliability. 
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Fig. 1—Single-ended amplifier with provision for unpowered transmission. 


Il. GENERAL DESIGN CONSIDERATIONS 


The use of the GaAs FET amplifier as an RF preamplifier for FM sys- 
tems requires low intermodulation as well as a low noise figure. In ad- 
dition, since the amplifier is common to several channels (including the 
protection channel), reliability is of utmost importance. The two most 
serious failure mechanisms envisioned are: (i) transistor failure and (ii) 
power supply failure. With either type of failure, the GaAs FET amplifier 
inherently exhibits an unacceptable transmission loss (>20 dB) for radio 
applications. Use of a balanced amplifier with two transistors coupled 
with input and output 3 dB hybrid couplers would reduce the gain by 
only 6 dB for failure of a single transistor. But this redundancy and extra 
cost gives no relief for loss of the dc supply voltage for the transistors. 

Schemes, without active devices, for reducing the loss to <10 dB for 
either type of failure and which apply to the balanced as well as the 
single-ended amplifier are shown schematically in Fig. 1 and 2. The signal 
reflected from the unpowered FETs is fed to the output by intercon- 
necting the normally terminated arms of the coupler (Fig. 2) or isolator 
(Fig. 1). We have designed, constructed, and evaluated both balanced 
and single-ended amplifiers. . 

The requirements for this application are shown in Table I. The choice 
of the design approach to meet these requirements was based on a de- 
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Fig. 2—Balanced amplifier with provision for unpowered transmission. 
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Table | — Electrical requirements for 3.7 GHz to 4.2 GHz amplifier 


Max. Min. Units 
Input return loss — 25 dB 
Output return loss _ 25 dB 
Noise figure 2.0 — dB 
Gain 11.0 8.0 dB 
Gain flatness +0.5 = dB 
Intermodulation (2A-B intercept) — 23 dBm 
Unpowered insertion loss 10 — dB 
Table Ii — Single-ended versus balanced amplifier 
Single Balanced 
Gain Same (8-11 dB) 
Unpowered loss Same 
Output return loss 225 dB >20 dB 
Input return loss 225 dB 217dB 
Transistor failure 6-8 dB loss 2-5 dB gain 
Intermodulation (2A-B intercept) 3 dB advantage 
Noise figure 0.3 dB advantage 
Transistors required 1 2 
Couplers required 0 2 
Circulators required 3 1 





tailed comparison of the capabilities of the two amplifiers. Based on our 
laboratory experience, Table II compares the performances that we 
consider practical in manufacture. We realized that meeting the inter- 
modulation and failsafe requirements with a single transistor would allow 
significant cost savings. The single-ended GaAs FET amplifier reported 
here not only meets these requirements but also has match and noise 
figure advantages. This results from the low loss input circulator which 
allows us to independently optimize the input circuit match and the 
transistor source impedance for minimum noise. 


Ill. AMPLIFIER MODULE ; 

The GaAs FET is mounted in a microstrip circuit \(Fig. 3). This 
transmission line permits easy mounting of the transistor and MOS 
dc-blocking capacitors. The amplifier module per se has no adjustments. 
Tuning screws near the input and output of the module and in the cir- 
culator arms are used to adjust the amplifier for optimum noise figure 
and gain flatness. This feature compensates for variations in transistor 
parameters as well as for manufacturing tolerances of the piece parts. 


3.1 The GaAs FET output circuit design 


In a first order approximation the output circuit elements were de- 
termined using BAMP.* Supplying the S-parameters and the input re- 


* Basic Analysis and Mapping Program. 
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Fig. 3—4 GHz MIC amplifier (open). 


flection coefficient (T)4nF) that results in a minimum noise figure, circles 
of constant gain are drawn (Fig. 4). If the output circuit reflection coef- 
ficient equals yz, optimum gain is obtained. Any deviation from [mi 
results in a loss of gain corresponding to the values indicated on the 
circles of Fig. 4. Strictly speaking, the set of circles is only valid for one 
frequency (in our specific case 4 GHz), and a corresponding set would 
have to be drawn for each frequency under consideration. Since the 
S-parameter variation over the 12.5 percent frequency band of interest 
is smooth and relatively small, one set of circles suffices to demonstrate 
that the output impedance, shown in a dashed line, is reasonably close 
to match. The actual circuit which produced the impedance was trimmed 
empirically for bandwidth and flatness of gain. 


3.2 The input circuit 


The theory of noisy four poles has been treated extensively in the 
literature.!- It essentially says that the noise figure of the four pole 
depends solely on the impedance of the input circuit. The noisy four pole 
is completely characterized by the S-, Y-, or Z-parameters, the source 
reflection coefficient (Tnr) at which the noise figure is minimum 
(NF min), and the equivalent noise resistance (R,,). The measurement 
of R, is somewhat cumbersome and is described in Ref. 1. Once the pa- 
rameters are known, circles of constant noise figure*° can be drawn (Fig. 
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which in our case 


is about 15.5 dB. It is quite obvious that the points for optimum noise 
figure and optimum gain are significantly apart. A set of circles similar 
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Fig. 4—Circles of constant gain with input tuned for minimum noise figure. 
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, solid circles). This set of circles is very insensitive to frequency and 


independent of the load impedance. The spread of the circles increases 
The reflection coefficient, yg, in Fig. 5 represents the reflection coef- 


with increasing R,,. In our specific case R, = 14 2. [yy, the maximum 
gain load impedance, has been explained in the output circuit design. 


ficient of the source that would yield maximum gain 
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assuming that the 


load reflection coefficient is [yz . To keep Fig. 5 from becoming over- 
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Fig. 5—Circles of constant noise figure. 


crowded, only one circle is shown. It is seen that a gain of about 13 dB 
for optimum noise figure versus 15.5 dB for optimum match can be ob- 
tained. This figure, of course, is further reduced by broadbanding and 
gain flattening, as can be deduced from the source impedance trace in 
Fig. 5. The performance of the single-ended amplifier module is shown 
in Fig. 6. The gain of 11.6 dB and corresponding noise figure of 1.5 dB 
are in good agreement with the values that can be extrapolated from Figs. 
4 and 5. 
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Fig. 6—Performance of amplifier module. 


3.3 Final circuit 


For the amplifier to be manufacturable, some adjustability is required 
to compensate for variations in transistor parameters as well as me- 
chanical tolerances on all components. This adjustability is not readily 
provided in the microstrip circuit, but can be economically introduced 
in the air dielectric stripline circuit. Pairs of tuning screws are thus lo- 
cated in the air line just in front and just after the microstrip module (Fig. 
7). These permit tuning of the amplifier for optimum noise figure and 
gain flatness. 


iV. FAILSAFE BYPASS CIRCUIT 


When the GaAs FET is unpowered, both the gate and drain circuits 
appear approximately as open circuits. The transmission loss typically 
exceeds 20 dB. If the transistor fails, we expect a short circuit. In either 
case, the input and output return losses at 4 GHz are typically 2 to 4 
decibels. 

The provision of three circulators, as shown in Fig. 1, provides an ef- 
fective passive by-pass circuit. In the normal state, the relatively small 
reflected input signal is recombined with the amplifier signal at the 
output of the transistor. This appears as a small ripple on the gain 
characteristic which can be compensated by output tuning. In the un- 
powered or failed state, both the gate and drain circuits are “switched” 
to open or short circuits. The input signal, with relatively small loss, is 
then directed to the drain circuit of the GaAs FET where it is reflected 
to the output circulator and directed to the load. The total insertion loss 
is typically 5 to 8 dB. 
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Fig. 7—4 GHz MIC amplifier. 


The circulators for the bypass circuit and the waveguide-to-stripline 
transition® were developed in air dielectric stripline (Fig. 8). This simple 
technology assures minimum circuit losses, low cost parts and assembly, 
and very high yields. The intermediate circulator is terminated with 50 
ohms to provide >25 dB isolation. Since this isolation is only maintained 
over the 3.7—-4.2 GHz band, positive feedback can cause the amplifier 
to oscillate at lower frequencies. The “low-pass filter’ on the output 
substrate (Fig. 9) eliminated this oscillation which, for our particular 
by-pass loop, occurred at about 800 MHz. 


V. POWER REGULATOR AND ALARM CIRCUIT 


The dc operating point for the GaAs FET is a compromise between 
minimum noise and acceptable linearity. A regulator automatically sets 
the gate voltage so that Ip = 15 mA and Vpg = 4.8 volts. All GaAs FETs 
are thus powered identically and require no bias adjustment in manu- 
facture. The amplifier (Fig. 10) operates from a —24 volt supply at 60 
milliamperes. 
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Fig. 8—4 GHz stripline circulator. 


In case of transistor or power supply failure (Jp <5 mA or Ip > 25 
mA), a contact to ground is provided which energizes a remote alarm. 


VI. THE LOW-NOISE TRANSISTOR 


The GaAs FET was developed at the Murray Hill, New Jersey Labo- 
ratory.’ The gate length and width are 0.8 um and 2 X 250 um. The 
typical noise figure is about 1.2 to 1.4 dB at 4 GHz. 
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Fig. 9—Amplifier module without transistor. 
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Fig. 10—Amplifier module (schematic). 


Vil. PHYSICAL DESIGN 


The completed amplifier is shown in Figs. 3 and 7. The aluminum 
housing is die cast in two parts. The stripline center conductor is stamped 
in asingle piece from sheet brass. Interlocking molded plastic locating 
rings are used to locate both the circulator ferrites and the center con- 
ductor in the lower housing channel. The printed circuit board with 
power regulator and alarm circuits (Fig. 11) is mounted on the bottom 
side of the lower housing. 


VIN. AMPLIFIER PERFORMANCE AND TESTS 
8.1 Tests 


In order to meet the requirements in Table I, the amplifier was 
subjected to several tests, most of which used straightforward test pro- 
cedures. Special test sets were constructed for noise figure and inter- 
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Fig. 11—Power regulator and alarm circuit of 4 GHz MIC amplifier. 
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Fig. 12—Complete three-frequency NF test set. 





modulation measurements. The noise figure can be accurately and 
rapidly measured at three frequencies in the test set shown in Fig. 12. 
Intermodulation (IM) tests were done using a three-tone measurement 
set. Since not all stations are air-conditioned, a humidity test was 
started. The amplifier was placed in an 85°C (185° F)—-85% humidity 
environment for two months with DC bias applied. No change in per- 
formance was detected. Six field trial models were cycled over the tem- 
perature range of 4°C (40°F) to 60°C (140°F) with no significant change 
in performance. 


8.2 Performance 


The amplifier is in manufacture at Western Electric Company and 
meets the requirements summarized in Table I. Typical performance 
values obtained are: 


NF: 1.6-1.8 dB 

Input and output return loss: 28 dB 
2A-B intercept: 26 dBm 

Unpowered transmission loss: 5-8 dB 
Gain: 10 dB. 


We find that the amplifier tuning arrangement permits the present 
spread in transistor parameters to be accommodated easily. 
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IX. CONCLUSION 


We have demonstrated a simple 4 GHz microwave amplifier design 
which achieves a noise figure of 2 dB in manufacture. This has been 
achieved with a single low noise GaAs field effect transistor in con- 
junction with a passive failsafe by-pass circuit. It is concluded that the 
single-ended amplifier with input and output isolator has significant 
advantages both in performance and in simplicity over the balanced 
amplifier design for this application. The housing and major piece parts 
are die replicated so as to fit together with minimal assembly effort. 
Tuning screws are provided to accommodate variations in transistor 
characteristics and to allow relaxed piece part tolerances. 
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Theory of Analytic Modulation Systems 


By B. F. LOGAN, JR. 
(Manuscript received March 18, 1977) 


A general theory of analytic modulation systems is developed where 
the transmitted signal is of the form a(t) = Re {e'#cf(z(t))}. Here f(z) 
is an analytic function (modulation law), and z(t) = x(t) + ty(t) is the 
analytic baseband signal whose real part x(t) is a bounded bandlimited 
signal of spectral support [—Q,Q] which is assumed to have a bounded 
Hilbert transform y(t). It is shown for a large class of \z(t)}and modu- 
lation laws that z(t) may be recovered using a receiver incorporating 
the inverse function of f as a detector with appropriate pre- and post- 
detection filtering. The theory also shows that in the procedure for 
factoring certain positive bandlimited signals, an approximate Hilbert 
transform operator (bandlimited) may be used. A related result is that 
signals subjected to logarithmic companding (one-sided) and filtering 
may be recovered by a non-iterative method. 


Il. INTRODUCTION 


In 1962, Bedrosian proposed a modulation system called single side- 
band phase (or frequency) modulation.* (See Ref. 5.) The modulated 
signal is of the form 


Re fellwet + x(t) +iy(t)) — e~ y(t) cos (wet + x(t)) 


where x(t) is the “baseband” signal, y(t) is the Hilbert transform of x(t), 
and w, is the carrier frequency. The special relation between the am- 
plitude modulation e~”“) and the phase modulation x(t) results in the 
modulated signal having no spectrum in the interval (—w,,w-); 1.e., the 
amplitude modulation removes the lower sideband. However, the 
spectrum is still infinite in extent. We adopt the terminology “single 
sideband exponential modulation” (SSBEM) for this system. 
Bedrosian pointed out that SSBEM was compatible with conventional 
FM receivers, and suggested that the single-sideband system might offer 


* K. H. Powers received a U.S. Patent (No. 3, 054 ,073) on such a system shortly before the 
appearance of Bedrosian’s paper. See Voelcker.2! 
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some savings in transmission bandwidth over the conventional system. 
However, since filtering operations could radically alter the zero crossings 
of the modulated signal, it was not clear to what degree one could 
maintain compatibility and at the same time realize some saving in 
bandwidth. 

Others?!017 have compared the spectral distribution of single side- 
band and conventional frequency modulated signals for the cases of si- 
nusoidal and Gaussian noise modulation. They have shown that the 
“effective” bandwidths, as measured by central second moments, of the 
single-sideband signals may be greater or less than that of the conven- 
tional FM signal depending on the nature of the modulation. At any rate, 
it is not clear how one would translate these results into relative band- 
width requirements of the two systems, each employing a conventional 
FM receiver. 

Aside from the compatibility question, Barnard‘ has shown that the 
transmission bandwidth requirements of single-sideband exponential 
modulation are, in a strict sense, minimal. He showed that if the mod- 
ulation x(t) belonged to a certain subclass of bandlimited signals with 
spectral support [—2,Q], that the modulation could be recovered, within 
an additive constant, from a knowledge of the spectral distribution of 
the single-sideband signal in the interval [w.,we + 2 + ¢], provided « > 
0. This was proved by demonstrating the convergence of an iterative 
recovery scheme. 

Here we consider a class of single-sideband systems wherein the 
modulated signal is of the form 


s(t) = Re {f(z(t))et«t} 


where f(z), the “modulation law”, is an analytic function and z(t) = x(t) 
+ iy(t) is the “analytic signal” of which, say, the real part x(t) is the in- 
formation to be transmitted. We suppose that x(t) is bounded and 
bandlimited with spectral support [—Q,Q] and that s(t) is transmitted 
over a channel whose transmission function is the Fourier transform of 
an absolutely integrable function (impulse response) and is equal to unity 
over (w,,w- + a). It is shown under fairly weak conditions on f(z) and 
z(t) that x(t) may be recovered (by a relatively simple non-iterative 
method) from the received signal, provided a > . 

The gist of the method can be grasped by considering periodic signals; 


e.g., 


127 . 
x(t) = a > Xpetkt 
_ (Xo = 0, say) 
z(t) = > X,et*e, 
i 
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We suppose that sup |z(t)| = m, and f(z), the modulation law, is an- 
alytic for |z| < m,(f(0) = 0, say) 


f(z) = y wie’, |z| Sm. 
1 
Then setting w(t) = f{z(t)} we have 
w(t) = > Wyeikt 
I 


where W;, depends only on those X; and a; for which | <j < k. This sort 
of dependence allows us to determine x(t) from a bandlimited version 
of w(t), say, 


Wy (t) = - W,,e%*¢, 
1 


by what amounts to reversion of power series, provided /’(0) # 0. We 
have 


z= o(w) => bw", for |w| sufficiently small. 
1 


(6) = (a1)~? = tf(0)}7) 


Assuming that the series converges when w is replaced by w,(t) we 
set 


Zn(t) = pwn (t)} oe 2 b, {wr (t)}* 
and then by formal composition of the power series find that 


n 7 bs : 
Zit), pees ye epee 
1 n+1 


So the first n Fourier coefficients of z(t) and z,,(t) agree; i.e., under the 
stated assumptions, x(t) can be recovered by bandlimiting ¢{w,, (t)}, 
where ¢ is the inverse function, z = ¢(w), and w,, (¢) is the partial sum 
of the Fourier series of w(t). 

The simplicity of this procedure owes to the fact that z(t) has a one- 
sided spectrum and the analytic modulation law f(z) then gives a 
function w(t) = f{z(t)} which also has a one-sided spectrum. Since z(t) 
contains no negative-frequency components, the usual difference terms 
do not appear; i.e., the spectrum of w(t) in the frequency interval [0,c] 
depends only on the spectrum of z(t) in the same interval. 

The recovery procedure is not so transparent for more general band- 
limited signals x(¢). First of all, filtering w(t) with a filter whose trans- 
mission function is unity over [0,ca] and zero for frequencies greater than 
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6 (8 > a) will give a function w,,(t), analogous to w,,(t), which may 
differ considerably from w(t). It may be that ¢{w.,3(t)} does not have 
a one-sided spectrum; i.e., ¢{W.,g(7)}, 7 = t + iu, is not analytic in the 
upper half-plane u > 0. Indeed wy.g(t), (—© <t < ©), may not even be 
in the domain of definition of ¢; i.e., ¢ may have a natural boundary 
beyond which it cannot be extended. Even if ¢{w.,3(t)} does have a 
one-sided spectrum, the function ¢ need not have a power series repre- 
sentation over the range of wa,g(t) so that one cannot use convolution 
arguments to show that the Fourier transforms of ¢{w(t)} and ¢{wa,a(t)} 
agree over [0,a]. This particular problem is met by using generalizations 
of the Paley-Wiener theorem. 

The problem arising when ¢{w,, g(t)} does not have a one-sided spec- 
trum (w.,g(t) not in the range of the inverse function) is met by imposing 
restrictions on z(t), namely that for sufficiently large u, the range of z(t 
+ iu) is sufficiently small that w(t + iu) will be in the range of the inverse 
function. This implies that w.,,(t) may be filtered (with a Poisson filter) 
to obtain wy,3(t +b), which for sufficiently large 6 will be in the range 
of the inverse function. One can then obtain z(t + 7b) and then use in- 
verse Poisson filtering to recover z(t). 

Although one could conceivably recover z(t) from w»,g(t) by other 
procedures when ¢{w,,3(7)} is not analytic for u = b, the method here 
avoids any decision process and gives a simple receiver model incorpo- 
rating the inverse function and (possibly) a Poisson filter with its 
(bandlimited) inverse and appropriate low-pass filter. 

Generally speaking, given f(z) and the channel transmission function, 
one can design a receiver which will work for a certain subclass {z(t)} of 
signals. Or given f(z) and a fixed receiver design, one may ask for the 
minimum bandwidth channel required for transmitting a given subclass 
of signals. In this connection, some estimates are given for the bandwidth 
requirements of “compatible” single-sideband exponential modulation 
with {z(t)} all functions of spectral support [0,1] satisfying sup |z(t)| < 
m. In a recent work, Werner”? has considered the same problem for z(t) 
in L» and gives upperbounds in terms of the L2-norm of z. 

There are rather dramatic mathematical simplifications in the de- 
tection theory when the signals {x (t)} are restricted to be of the band-pass 

_type, allowing radical changes in the system design. 

The theory also shows that the factorization of certain positive band- 
limited signals can be effected with an approximate Hilbert transform 
operator acting on the logarithm of the signal. A related result pertains 
to the signal recovery problem considered by Landau and Miranker!!!2, 
viz., there is one companding function (“‘log’’) for which the signal can 
be recovered by a non-iterative method. 

Another interesting consequence of the general theory is the fact that 
for n arbitrary numbers az, k = 1,2,...,n, there exists an integer vy = 
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n and corresponding numbers az, k =n +1,...,v, such that the poly- 
nomial 


P,(z) = 1+ - apz® 
7 


is zero-free for |z| < 1. 

Although the general theory is interesting from a mathematical 
viewpoint, it would appear that the practical interest in analytic mod- 
ulation systems, other than the linear system, is limited to SSBEM, 1.e., 
to the case f(z) = e2 (or e!2). In this case one can trade bandwidth for 
simplicity of detection. However, the trade-off is attractive only for 
moderate amplitudes of z(t) where SSBEM offers an interesting alter- 
native to other systems employing envelope detection. 

It should be noted that the method here is naturally confined to 
bounded bandlimited signals z(t), since otherwise we would require both 
f(z) and its inverse ¢(w) to be entire functions, i.e., w = f(z) =a + bz. 
The exception would be the band-pass case where f could be an entire 
function and ¢ replaced by an equivalent polynomial (see Section 5). 

Of course, the theory here has to be extrapolated to practice with the 
appropriate “epsilons”; i.e., an analytic modulation law f(z) can only be 
approximated within ¢, over the disk |z| < m, and an analytic signal z(t) 
having one-sided spectrum can be realized in practice within ¢2, and the 
impulse response of an ideal filter can be approximated (in L,) within 
€3, etc. Then the continuity of the overall transformation may be used 
to bound the errors. 

In order to deal rigorously with “communication type” signals which 
do not have ordinary Fourier transforms a considerable amount of pre- 
liminary mathematics is required. However, one can follow the theory 
assuming that the signals are either periodic or have ordinary Fourier 
transforms, with one cautionary note in mind. Abrupt bandlimiting 
operations (spectral projections) such as convolution with sin t/t are 
not permissible (not defined) for the general signals of interest. 


Il. PRELIMINARIES 


A measurable function g(¢) is said to belong to Lp)(—~,~) abbreviated 
hereafter as Lp, (1 S p < ~) if 


f. lg(t)|Pdt < ~, 


The L,p-norm of g is defined by 


lello =| flee lear} ” 
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and if a is ascalar 


lagi = |e] - llglly. 


If |g(t)| is uniformly bounded, with the possible exclusion of a set of 
measure zero, g is said to belong to L.. and the norm of g is 


llgll. = essup |g(¢)| 
t 
where “‘essup” over ¢ is the essential supremum of |g(t)|, which is the 
infimum of numbers M such that 
|g(t)| < M for almost all ¢. 


We will be mainly concerned with continuous bounded functions g(¢) 
in which case 


lg. = sup |g(t)]. 
t 


For 1 < p < ~, the Lp norm satisfies the triangle inequality 
gi + gallp S llgillp + ligallp (1) 


which for any sequence of numbers a; satisfying D|a;| < © and se- 
quences of functions gz such that ||g, || <M, leads to 


| Sangre lly S Xan llgelly- (2) 


There are functions which belong to Lp for only one value of p. 
However, it is easy to see by considering the set where |g(t)| < 1 and the 
set where |g(t)| > 1, that if g belongs to L, and L, where 1 <r <s, then 
g belongs to Lp for every p satisfying r < p < s. For example, the func- 
tion sin t/t belongs to Lp for every p > 1. 

Associated with the space Lp is the conjugate or complementary space 
Ly where 


1 1 
=+==1, 
D @q 
For functions in complementary spaces we have Hélder’s inequality 
J e@h(oae| < Iellplinly, 1S p Se, (3) 








which for the case p = q = 2 is the familiar Schwarz’s inequality. 
In connection with Hélder’s inequality we note that the norm of a 
function may be equivalently defined as 


i. _ h(t)g(t)dt 





» hl =14¢=p/(p— 1). 





llgl_p = sup 
h 
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A convolution kernel K in L, carries Lp, into Lp. We have 


- ginLk 
K® n= f Ke=20 P 
g(t) __ ls) (t — s)ds Kin L, 


and 
IK @gllp < WKllllg|p (4) 


which amounts to a generalization of (2) to weighted sums of translates 
of g. The convolution integral is not in general defined for each t unless 
K belongs to the conjugate space of g. In general the convolution is de- 
fined as the limit in Lp of 


Bm(t)= {_ g(s)Km(t ~s)ds 


where K,, is asequence of bounded functions of L; (hence K,, belongs 
to L,) satisfying 


lim ( |K(t)—Kn(t)|dt = 0. 


m--o 


2.1 The Fourier transform on L, 


A function g in Lp has an ordinary Fourier transform, provided 1 < 
p S 2, (a theorem of M. Riesz, cf. Ref. 20) in the sense that 


T 
ér(w) = f. b(tvenfetde 


converges in norm as 7’ — ~ toa function g(w) belonging to the com- 
plementary space Ly, i.e., there exist a function g in L, such that 


lim |g — 8rllq = 0. 
To 


However the Fourier transform on Lp, 1 S p S 2, does not carry Lp into 
all of Lg except in the case p = 2. In particular, the Fourier transform 
of a function g of L; is a continuous function. Furthermore, (the Rie- 
mann-Lebesgue Lemma) 


lim g(t)e—“'dt = 0 
wrto J—@ 
for g in L;. Unfortunately, there is no simple description of functions 
-£(w) which are the Fourier transforms of functions g(t) of L;. A useful 
sufficient cordition is that (w) belong to Le and have a “derivative in 
L»” (meaning only that £(w) is the integral of a function of Lz denoted 
by dZ/dw). The sufficiency of this condition may be seen by writing 
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: = 1 
i. lg(t)|dt = f. (e+ ial -|(t + ia)g(t)|dt (a >0) 


and then applying Schwarz’s inequality and Parseval’s theorem. The 
result is (choosing the best value of a) 


{ f- letelae}” < lle: 


In obtaining this result we use the fact that 2 in Ly and dg/dw in Le imply 
that 2(w) tends to zero at +~. (By Schwarz’s inequality, gdZ/dw belongs 
to Ly, so {8 (w)}? is absolutely continuous and tends to limits at £~. the 
limits must be zero in order for £ to belong to Lo.) 


(5) 











Bs 


2.2 Bounded functions whose Fourier transforms vanish over certain sets 


It is not necessary to attempt to define the Fourier transform of a 
bounded function g(t) in order to give precise meaning to the statement 
that the Fourier transform of g(t) vanishes over some open set E. This 
can be done in a way which is consistent with the ordinary Fourier 
transform, should it exist, of g(t) vanishing over E. Here we restrict E 
to be the union of a finite number of disjoint open intervals. 


Definition: The Fourier transform of a bounded function g is said to 
vanish over E if and only if 


(i f__ eh(eat = 0 
for all hin L, whose Fourter transforms satisfy 
(ii) h(w) = h(t)e-tdt =0, w EE. 
This definition has its logical basis in Parseval’s formula for functions 


of Ly». The bar over h in (1) denotes the complex conjugate of h. It is 
readily verified that (1) may be replaced by 


(iii) | f ” g(t)h(—t)dt = 


which is more directly applicable to convolutions. That is, if the Fourier 
transform of g vanishes over E we have 


(g@h)(t)= f gls)h(t—s)ds =0 (6) 
for all h in L; whose Fourier transforms vanish outside E. 
We also note that in case the set E is symmetric with respect to the 


origin, h(t) in (i) may be replaced by h(t). [See Ex. 4 below.] 
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We say that the Fourier transforms of two bounded functions g; and 
&2 agree over E if and only if the Fourier transform of (g1 — g2) vanishes 
over E. We also say that g has spectral support E, (a closed set), or the 
spectrum of g is confined to E., meaning that the Fourier transform of 
g vanishes over the complement of E¢. 

The following are some elementary consequences of the definition. 
The proofs are left as simple exercises. It is understood throughout that 
&, £1, and go are bounded functions. 


Example 1. Suppose all the intervals composing E are finite and K(t) 
ts any function of L, whose Fourier transform K(w) satisfies 


K(w)=1 for wEeE. (7) 
Let 


got) =f gils)K(t ~ s)ds. (8) 


Then the Fourier transforms of g; and ge agree over E. 


Example 2. If the Fourier transform of g(t) vanishes over (a,8), then 
the Fourier transform of e*g(t) vanishes over (a + \,8 + X). 


Example 3. If the Fourier transform of g, vanishes over E,; and the 
Fourier transform of gg vanishes over Eo, then the Fourier transform 
of (g1 + 2) vanishes over EF; (\ Eo. 


Example 4. If the Fourier transform of g vanishes over E, then the 
Fourier transform of the complex conjugate % vanishes over E‘~), where 
E“) denotes the reflection of E with respect to the origin. 


Example 5. If the Fourier transform of g vanishes over E, then the 
Fourier transform of Re {g} (or Im \g}) vanishes over E (\ EO. 


Note: E (\ E™ may be the null set. However, if E is symmetric with 
respect to the origin, E = E‘—). Hence a class of functions whose Fourier 
transforms vanish over a set E which is symmetric with respect to the 
origin is essentially a class of real-valued functions, since the real and 
imaginary parts of the functions separately belong to the class. 


Example 6. (Reproducing Kernels) Suppose the spectrum of g is con- 
fined toa set E, consisting of n finite disjoint closed intervals. Let K(t) 
be any function of L, whose Fourier transform K(w) satisfies 


K(w)=1, we E. (9) 
Then for almost all t we have 
g(t) =f a(s)K(t ~s)ds. (10) 
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(Set g; = K ®g and show that (g; — g) is orthogonal to all of L.) 


Note: The qualification “almost all t” arises because the definition of 
a function g on a set of measure zero is irrelevant to the condition for its 
Fourier transform to vanish outside E,. However, (K ® g)(t) is a con- 
tinuous function of t and so we will adopt the convention that a function 
such as g in Ex. 6 is continuous. 

We note further that the condition that K in Ex. 6 belong to L, can 
be relaxed in case g belongs to L, for some p satisfying 1 < p < ~. In this 
case we can take K(w) = 0 forw €& E,. It is sufficient to prove this when 
E, is a single interval [—2,Q] and this has been done (Ref. 14). 


2.3 The Paley-Wiener Theorems for L~ 


There is an important connection between functions whose Fourier 
transforms vanish over a half-line and functions analytic and of expo- 
nential type in a half-plane. The following theorems are extensions to 
L. of the classical “one-sided” and ‘“‘two-sided”’ Paley-Wiener Theo- 
rems!8 for Lo. 


Theorem 1. The Fourier transform of a bounded function g vanishes 
over (—~,a) if and only if g(t) is the boundary value of a function g(r), 
7=t+1u, analytic in the upper half-plane u > 0 and satisfying 


sup |g(t + iu)| < e7*“ sup |g(t)| for u = 0. (11) 
t t 


There is the analogous theorem connecting functions g(t) whose Fourier 
transforms vanish over (8,~) and functions g(z7) analytic in the lower half 
plane. The specialization of Theorem 1 to functions whose Fourier 
transforms vanish over (—~,a) and (8,~) is the following. (We assume 
that —» <a@< 6 < ™ and according to the convention above qualify g 
to be continuous.) 


Theorem 2. The Fourier transform of a continuous bounded function 
g vanishes outside [a,8| if and only if g(t) is the restriction to the real 
line of an entire function g(r), 7 = t + lu, satisfying 


sup |g(t + iu)|< e~* sup |g(t)|, u 20 
t t (12) 
<e® sup |g(t)|, u <0. 
t 
These theorems are essential to the theory of single-sideband systems 
for bounded signals which do not have ordinary Fourier transforms. 
Actually we do not need a uniform bound on the rate of growth (decay) 


of g(t + iu) to infer that the Fourier transform of g(t) vanishes over 
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(—~,a). In fact, an asymptotic bound implies a uniform bound. 
Theorem 3. If g(r) ts analytic in the upper half-plane and satisfies 
sup |g(t +iu)|<e for u2=0 
L 


then the asymptotic estimate 
sup lg(t + iu)| = Ofe-"“} asu> @ 
implies 
sup lg(t + iu)| < e- sup |g(t)| for uO. 


Proofs of Theorems 1, 2, and 3 are given in Appendix A. 
We note the following corollaries of Theorems 1 and 2, concerning the 
Fourier transforms of products. 


Corollary 1. If the Fourier transform of g, vanishes over (—~,a) and 
the Fourier transform of g2 vanishes over (—~,8), then the Fourier 
transform of £182 vanishes over (—”,a + £). 

Corollary 2. If the Fourier transform of g vanishes outside [a1,81| and 
the Fourier transform of g2 vanishes outside [a2,B9|, then the Fourier 
transform of 212 vanishes outside [a1 + a2,81 + Bal. 


2.4 Terminology 


Functions whose Fourier transforms vanish outside a finite interval 
are called bandlimited functions. Generally, we think of the interval 
centered at the origin and refer to bandlimited functions also as low-pass 
functions. 

Functions whose Fourier transforms vanish over an interval centered 
at the origin are called high-pass functions, and functions which are both 
high-pass and low-pass are called band-pass functions. 

Functions (signals) whose Fourier transforms vanish over a half-line, 
usually (—~,0), are generally called analytic signals. 


2.5 The Hilbert transform and the analytic signal 


We would like to map the space of real-valued bounded signals x(t) 
of spectral support [—Q,Q] into the space of complex-valued bounded 
signals z(t) of spectral support [0,Q]. We would like the mapping to be 
linear and also have the property that translates of x map into translates 
of z, so that no “time stretching” is involved. The usual way of doing this 


is to take 
z(t) = x(t) + iy(t) (13) 


where y = X, the Hilbert transform of x. 
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The Hilbert transform is defined by 
1 foo) 
z(t) =— f oA ig (14) 
vil —_ 


ot = Ss 
where the cut in the integral sign indicates a Cauchy principal value at 
s = t. The difficulty we encounter is that an arbitrary bounded band- 
limited function does not have a Hilbert transform so we have to restrict 
x somehow. 
We may regard the Hilbert transform as the limit as a — 0 of convo- 
lution transforms 


Eq (t) = J #()Kalt ~ s)ds (15) 
with regular kernels K, given by 
1 t 
Kali) =— = 34 > 0, 16 
ee eT oo 


: Now K,(t) belongs to L, for every p > 1 and has a Fourier transform 
Ka(w) given by 


Ka(w) = —i(sgn w)e~@lel, (17) 


Thus if x(t) has a Fourier transform £(w), the Hilbert transform £(t) has 
a Fourier transform given by 


f. ” #(t)e—tdt = —i(sgn w)£(w) (18) 


and consequently the Fourier transform of z(t) as defined in (13) van- 
ishes over (—~,0). Now z(t) is the boundary value of the function z(7), 
7 =t +1u, defined by 


er) =~ f° AN Ge “iii. (19) 
HJS-oT—S 

In case x does not have an ordinary Fourier transform, but has a bounded 
Hilbert transform £, the function z(7) is bounded and analytic in the 
upper half-plane and according to our definition and Theorem 1, the 
Fourier transform of z(t) vanishes over (—@,0). Also if the Fourier 
transform of x(t) vanishes outside [—,Q] then the Fourier transform 
of z(t) vanishes outside [0,Q]. 

The subclass of bounded functions which have bounded Hilbert 
transforms does not have a simple alternate description. In order for x(t) 
to have a bounded Hilbert transform it is sufficient that x(t) have a 
bounded derivative and a bounded integral (Ref. 15). If the Fourier 
transform of x vanishes outside [—,Q] then x(t) has a bounded deriv- 
ative (‘““Bernstein’s Theorem,” cf. Theorem 11.12, Ref. 6) satisfying 
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sup |x’(t)| < Q sup |x(t)|, (20) 
t t 


and there are various ways to restict x(t) to have a bounded integral. For 
example, we may assume that g(t) is an arbitrary bandlimited signal and 
set 


x(t) = g’(t). (21) 


(Also any high-pass function has a bounded integral (Ref. 14).) Then we 
can determine g(t) within an additive constant from the real part of z(t). 
Assuming that the Fourier transform of g vanishes outside [—2,Q] we 
have the inequality (Theorem 11.4.3, Ref. 6) implied by (21), 


sup |<(¢)| < @sup |g(t)]. (22) 
t t 


An interesting subclass of bandlimited functions which have bounded 
Hilbert transforms are the band-pass functions. For these functions, 
there are equivalent Hilbert transform kernels which belong to L. If the 
Fourier transform of x(t) vanishes outside the intervals [-Q,—rQ] and 
[rQ,Q] where 0 <r <1, then x has a bounded Hilbert transform satisfying 
(Ref. 7) 


sup |<(¢)| < [A += log sup |x(t)| (23) 
t us r t 
where A < 4/z and 2/z cannot be replaced by a smaller number (i.e., as 
r approaches zero). 
Other subclasses worthy of mentioning may be generated by convo- 
lution transforms on L... Thus if |g| < M and k isa kernel in L; which 


has a Hilbert transform also in L, then the class of functions of the 
form 


x(t) = f. g(s)k(t — s)ds (24) 
have bounded Hilbert transforms given by 
z(t) = Mi g(s)k(t — s)ds. (25) 


For example, x(t) may be the output of some crude sort of band-pass 
filter (like an a-c amplifier) as would be the case for 


k(t) =ae~2! — bet, tz0 
= 0, t<0 (26) 


where a > 6b > 0. Then 


sige 2 2 (a — b)iw 
qi eer a atiw btiw (atiw)(b+iw) a) 
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and 
sy ee eee (a — b)|a| 
is EMO ONG eee Bay. (28) 


It follows from (28) and (5) that k belongs to Ly. 

Another sufficient condition for a bandlimited function x(t) to have 
a bounded Hilbert transform is the requirement that x belong to Lp 
where 1 < p < ~. Then if the Fourier transform of x vanishes outside 
[—,Q] we have (Ref. 14) 


x(t) = f. fg) es (29) 
— a(t — s) 
Then x has a Hilbert transform given by 
© 1 a t es 
£(t) = f 4g =e) (30) 
—o a(t —s) 


and thus by Hélder’s inequality 


a ° 11 — cos Qt 
21 < lel { fF [SS 


q I/q 
ac! 
at 





where A + z =]. (381) 
Dp @q 


However, the condition that x belong to Lp (1 S p < ~) is not a sat- 
isfactory condition for communication signals. 

Hereafter we will suppose that x(t) is so restricted that z(t) = x(t) + 
iy(t) is a bounded function whose Fourier transform vanishes outside 
[0,Q]. We should note that this assumption does not imply that y is the 
Hilbert transform of x (even within an additive constant). That is, the 
assumption does not imply that the integral 


ia x(t) — x(0) di 


T t 


tends to a limit as T — ~, so in effect we are allowing some functions that 
are not of the form z(t) = x(t) + 1€(t). In case we further restrict z(t) 
to be a bounded function whose Fourier transform vanishes outside 
[rQ,Q] where 0 <r <1, then we can assert that z(t) is of the form x(t) + 
ix(t) where the Fourier transforms of x and & vanish outside [—Q,—rQ] 
and [rQ,Q]. Conversely if x is a real-valued (band-pass) function whose 
Fourier transform vanishes outside these two intervals, then x(t) has 
a bounded Hilbert transform <(t) and {x(t) + 1%(t)} is a bounded function 
whose Fourier transform vanishes outside [rQ,Q]. In other words, we can 
always regard any real-valued band-pass signal x(t) as the real part of 
an analytic signal z(t). This is a special case of a representation theorem 
for high-pass signals (Ref. 13). 
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Ill. MODULATION AND EQUIVALENT BASEBAND TRANSMISSION 


We suppose that z(t) is a bounded bandlimited signal with spectrum 
confined to [0,0]. Now let f(z) be a function which is analytic over a re- 
gion which includes the disk |z| < m, where m = sup |z(t)|. We take f(z) 
as a modulation law and generate 


w(t) = fiz(t)} (32) 


which is the boundary value of a function w(r) bounded and analytic in 
the uhp. Hence the Fourier transform of w(t) vanishes over (—,0). 
Generally, the modulation process also includes translation of the 
spectrum of w(t) leading to a transmitter output 


a(t) = Re {w(t)e'e4}. (33) 


where w, > 0 is the carrier frequency. The Fourier transform of o(t) then 
vanishes over (—w,,w,) and hence a(t) is called a single-sideband signal, 
although the upper side-band may be infinite in extent. 

In conventional single-sideband amplitude modulation (SSBAM) the 
modulation law is the linear law, f(z) = z, in which case the Fourier 
transform of o(t) vanishes outside the intervals [w.,we + Q] and [—w,. — 
Q, —w,], so that the bandwidth required for transmitting o(t) is (counting 
positive and negative frequencies) 2 + ¢ where «¢ is an arbitrarily small 
positive number. In other words, o(t) has a reproducing kernel in L, of 
bandwidth slightly larger than 20. (Recall that the Fourier transform 
of a function of L1 is continuous.) We will see that the spectral economy 
of SSBAM carries over to more general modulation laws f(z). So we as- 
sume that o(t) is transmitted over a channel (characterized by an Ly 
impulse response) which has unity transmission over the frequency 
bands [w,.,we + a] and [—w, — a,—w,.| where a > Q. The transmission 
may be zero outside slightly larger intervals. 

We denote the received signal by og(t) and since its Fourier transform 
vanishes over (—w,w-) it has a Hilbert transform Gr (t). We assume that 
the carrier frequency and phase are known at the receiver so that we can 
form 


Walt) = e'elor(t) + 1éR(t)}. (34) 


In engineering parlance the real part of w(t) is obtained by in-phase 
synchronous demodulation of or (t), while the imaginary part of w,(t) 
is obtained by quadrature synchronous demodulation of or(t). The 
Fourier transform of w,,(¢) vanishes over (—~,0) and we have 


walt) = f. w(s)Ka(t — s)ds (35) 
where K,,(t) is the impulse of an equivalent baseband channel satisfying 
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for some a > Q 


f.. K,(t)e-i#tdt =1, for0<w<a (36) 


f. |K,(t)|dt < ~, (37) 


Hereafter we will be concerned with recovering z(t), and hence x(t), from 
W(t) in the equivalent baseband transmission of w(t) as given by 
(35). 


IV. THE INVERSE FUNCTION AS A DETECTOR 


We would like to solve (35) for z(t) where w(t) = f{z(t)}. This is (su- 
perficially) similar to the problem of Landau and Miranker!!? where 
w(t) = f{x(t)} and f is a real function of a real variable, x(t) is a real- 
valued bandlimited function of L2 whose Fourier transform vanishes 
outside [—0,Q], and K,(t) = (sin Q¢)/rt. In order for x(t) to be recovered 
(by an iterative process) they require that f have an inverse over the 
range of x and that is essentially what we require. The inverse of an an- 
alytic f is more complicated, but the fact that the Fourier transforms of 
z(t) and f{z(t)} vanish over (—~,0) simplifies the recovery problem. 

Let us write 


w = f(z) (38) 
and 
z= g(w) (39) 


for the inverse and think first of the problem of recovering z(t) from w(t). 
In case f maps |z| < m one-one onto some region D*, there is no problem 
since vis single valued over D*. In general vis not single valued and we 
have to know something about z(7) in order to decide what element of 
y is “the” inverse. For example, suppose f(z) = 2z + z2. Then given 


w(t) = 2ae™ + a2ei 


we do not know whether z(t) = ae“ or z(t) = —2 — ae! without some 
additional knowledge, such as for example, lim,—. 2(¢ + iw) = 0, or |z(t)| 
< 1. In this example the inverse function, 


= ow) = -1+ (1 +w)¥? 


is not a single-valued function of the complex variable w and one gen- 
erally speaks of two branches of the inverse function. The branches have 
singularities at w = —1, the image of z = —1 where f’(z) = 0. Clearly in 
this case, if we require |z(t)| < 1 then we know 


2(t) = -1+vV1+4+ w(t) 
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where V1 = 1. Then g{w(t + iw)} is bounded for u = 0 and analytic for 
u > 0. If we relax the requirement to |z(t + iw)| <1 for u > 6 then qw(t 
+ iu)} will be analytic for u > b, but not necessarily for u > 0. 

In general, we require that z(r) and f(z) are so constrained that 
y{w(r)}is analytic for u = uo. 


4.1 Received signal in the range of the inverse function 


Now we are not given w(t) but instead we have a filtered version w,(t). 
Suppose ¢{w(7)} is analytic in the upper half-plane u = 0 and w,(t) is 
sufficiently close to w(t) that ¢{w,,(7)} is also analytic in the uhp u 2 0. 
We say then that w,(t) is in the range of the inverse function. A simple 
sufficient condition for this is that ¢ be an entire function. Also the 
channel could have sufficiently large bandwidth for w,(t) to be close 
enough to w(t). 

We assume then that 


w,(t t+iu) EC D*, u=zo (40) 

where 
¢g(w) is analytic for w © D* (41) 
|y’(w)| < M for w € D* (42) 


Then we may take the inverse of w,(t) to obtain 
Zq(t) = gofWa(t)}. (43) 


Now we will see that the Fourier transforms of z,(t) and z(t) agree over 
(—~,a). 

First, it follows from (35)—(37) and Ex. 1 of Sec. 2.2 that the Fourier 
transforms of w(t) and w,(t) agree over (—~,q); i.e., the Fourier trans- 
form of {w(t) — w,(t)} vanishes over (—”,a). Then from Theorem 1 we 
have 


|w(t + iw) — wa(t +iu)| < e~% sup |w(t) — wa(t)|. (44) 
t 


From (42) we have 
|y(w) — g(w,)| < M|w — w,| for w € D* 


Wa € D* Cd 


Thus {z(t) — z,(t)} is the boundary value of a function bounded and 
analytic in the uhp satisfying 


[z(t + iu) — z,(t + iw)| < Me~* sup |w(t) — w,(é)|. (46) 
t 
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Hence from Theorem 1, 
z(t) — z_(t) = h,(t) (47) 


where the Fourier transform of h,(t) vanishes over (—~,a). Since the 
Fourier transform of z(t) vanishes outside [0,Q] and a > Q, (47) implies 
that we can bandlimit z,,(¢) with an appropriate low-pass filter to obtain 
z(t). Thus if Ko,.(t) is any kernel of L; satisfying 


f. Koa(t)e-tdt =1, 0<w<2 
0, w2a (48) 


we have from Ex. 6, Sec. 2.2, with the convention that z(t) is continu- 
ous, 


z(t) = f. 2(s)Koa(t —s)ds (49) 


and since the Fourier transform of h, vanishes over (—~©,a) we have (Ko,« 
@h,)(t) = 0; i-e., 


2(t) = f . bela Konlt= sds: (50) 


4.2 Pre-detection filtering 


In case the received signal w,(t) is not in the range of the inverse 
function, we may under suitable conditions recover z(t) by appropriate 
filtering before (and after) detection. Here then we replace (40) with the 
condition 


w(t tiu) €& D* foru = uo (= 0). (51) 


It follows from (44) and (42) that for sufficiently large b we have 
Wa(t tiu) © D, foru=b (52) 
where Dj is slightly larger than D* and 
y(w) is analytic for w € Dj; (53) 
| o’(w)| < My for w € Dj. (54) 


Then we have w,(t + 1b) in the range of the inverse function. 
Now the Poisson kernel with ha u 
P,(t) =—-—— >0 (55) 
Se. 
ae a 


reproduces functions bounded and analytic in the uhp from their 
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boundary values. (See the proof of Theorem 1 in Appendix A.) We 
have 


peii= f. ile) PA elds: (56) 


That is, we may determine w,(r7) along a line u = 6 parallel to the real 
axis by convolving w,(t) with the Poisson kernel (parameter u = b). This 
operation we term Poisson filtering. Since, by assumption, w,(t + ib) 
is in the range of the inverse we may take the inverse of w,(t + 1b) to 
obtain 


Za(t +ib) = glw,(t + 1b)} (57) 
which is analytic in the uhp and then as argued before 
j2(t +iu + ib) —z,(t + iu +ib)| < Moe. (58) 


So the Fourier transforms of z(t + ib) and z,(t + 1b) agree over 
(—2,c). 

Thus if the conditions (51), (41), and (42) are met we may by suitable 
pre-detection filtering (Poisson filtering) obtain a function z,(t + 1b) 
which corresponds to replacing z(t) at the transmitter by z(t + ib); i.e., 
from the reproducing property of the Poisson kernel 


w(t +i) = : HOPLG= Has 


= flz(t + iu)} = f “ flels)Pult — s)ds. (59) 


Then interchanging the order of convolutions with P,(t) and K,(t) in 
(35) we have 


wa(t + ib) = f. flz(s + ib)}]Ka(t — s)ds. (60) 
This relation has been noted by Foschini.® 
The Poisson kernel is a contraction operator; 1.e., it averages the values 
of a function so that the range of the resultant is no larger than the range 
of the function. Also as a filter it has the frequency response 
Ws P, (t)e~i#tdt = e~“lel, (w> 0). (61) 
We have 
z(t + ib) = ne z(s)Pp(t — s)ds (62) 
and therefore for large b we would expect the range of z(t + 1b) to be 


ANALYTIC MODULATION SYSTEMS 509 


appreciably less than the range of z(t) for a wide class of z(t) since for 
very large b it is only the low-frequency content of z(t) that contributes 
appreciably to z(t + 1b). It is possible, though, for the low-frequency 
content to be such that, for every u = 0 


z(t +tiu)~cos VitisinVt ast. 
If we require z(t + iu) to tend uniformly to a limit zp as u > &3 ice., 
z(t +iu) —zo| <e(u), -—~ <t<o, (63) 
where ¢(u) — 0 asu — © and in addition 
f’(zo) # 0, (64) 
then ¢(w) will be analytic in the neighborhood of wo = f(z) and so (51) 
will be satisfied for sufficiently large up with (41) and (42) holding. 
The condition (63) is not a severe constraint. In fact, all the simple 
sufficient conditions, given in (21)-(29), for x(t) to have a Hilbert 
transform imply (63) with zo = 0. However, x(t) may have a Hilbert 
transform without (63) holding. 
In connection with pre-detection filtering, we note that equivalent 
Poisson filtering can be effected at the carrier frequency (or an inter- 
mediate frequency) in the receiver before the synchronous demodulation 


of the received signal op (t) indicated in (34). That is, if the signal op (t) 
is passed through a filter whose frequency response Fy (w) satisfies 


Fy(w) = eee), ww > we (65) 
Fy (—w) = Fy (w) 
a signal or (t;b) is obtained which we may identify as 
op(t;b) = Re {e'#tw,(t + ib)}. (66) 
Then synchronous in-phase and quadrature detection of op(t;b) yields 
the real and imaginary parts of w,(t + 1b). Thus the Poisson filtering 
may be accomplished with a single equivalent frequency-translated filter, 


whereas the direct Poisson filtering of the complex signal w.(t) requires 
two Poisson filters acting separately on the real and imaginary parts. 


4.3 Post-detection filtering 


When Poisson filtering is required to bring the received analytic signal 
within the range of the inverse function the low pass filtering after de- 
tection must be modified. The output of the detector is z,(t + ib) and 
we need z(t). Now the Fourier transforms of z(t + 1b) and elt + ib) 
agree over (—~,a) and 


ye abyS f. 2(s)P,(t — s)ds. (67) 
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The Poisson filtering operation has an inverse for bandlimited func- 
tions; L.e., 


ss f. z(s + ib)Q,(t — s)ds (68) 
where Q,(¢) is any function of L; satisfying 
ff. ” Q,(t)e-i#tdt = eo, 0<w <Q. (69) 


Since the Fourier transforms of z(t + ib) and z,(t + 1b) agree over 
(—~,qa), the Fourier transform of 


zt)— f z,(¢ + ib)Q,(t —s)ds (70) 


vanishes over (—~,a), and since the Fourier transform of z(t) vanishes 
outside [0,Q], we have [cf. (47)-(50)] 


2(t) = f. z_(s + ib)k(t — s)ds (71) 
where k(t) = k(t;b,Q,a) is any kernel in L satisfying 
f__bWenivtde = er, OS ws 0 (72) 
= 0, wa 


That is, the post-detection filtering must invert the pre-detection fil- 
tering over the band [0,Q] and remove frequencies greater than a. (Of 
course, in a practical system we are interested in recovering only x(t) so 
that only one post-detection filter is required, acting on the real part of 
Z(t + 1b).) 


V. SPECIALIZATION TO BAND-PASS SIGNALS 


In case the base-band signal x(t) is of the band-pass type, i.e., a signal 
whose Fourier transform vanishes outside [rQ,Q] and [—2,— rQ] where 
0<r <1, the detection theory may be modified so that no Poisson fil- 
tering is required. In this case the inverse function may be replaced by 
an entire function, in particular, a polynomial. All we require for the 
recovery of band-pass signals is that f{z(7)} be analytic in the uhp and 


f’(0) # 0. (73) 
Then 


w = f(z) = x a,z" for |z| sufficiently small (74) 
k=0 
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z= g(w) = x bp(w — ao)" for |w — ao] sufficiently small. (75) 
k=1 


Here b; = (a;)"! = [f’(0)]1. 

Now for band-pass signals x(t), the Fourier transform of the analytic 
signal z(t) vanishes over (—~,rQ) where 2 > 0 and 0 <r < 1. Thus by the 
Paley-Wiener Theorem for La, 


|z(t +iu)| < e-"® sup |z(t)|, u =O. (76) 
t 
Then we have 


w(t +iu) —ag= x a,{z(t tiu)}* for sufficiently largeu. (77) 
k=1 


It follows that the Fourier transform of {w(t) — ao} also vanishes over 
(—,rQ). Hence the Fourier transform of {w,(t) — ao} vanishes over 
(—~,rQ). 

Now let ¢*(w) be any entire function of the form 


g*(w) = Dy Ch(w — ao)* (78) 
where 
Ce = by, fork =1,2,...,n (79) 
and n is an integer such that 
nr > 1. (80) 
Defining 
Za(t) = g*{wa(t)} (81) 


we have z,,(7) analytic in the uhp and for sufficiently large u 
z(t t+iu) —z,(t + iu) 
= glw(t + iu)} — ¢*{wa(t + iu)} 


= ¥ balfw(t + iv) — aol — {walt + iw) — ao}*] 
k=1 
+ x bp{w(t + iu) _ ao} 
k=nt+1l 


— > CR{Wa(t + iu) — ao}*. (82) 
k=n+1 


Since the Fourier transforms of {w(t) — ao} and {w,(t) — ao} vanish over 
(—,rQ), the last two sums in (82) are of the order of exp {—(n + 1)rQu}. 
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Also, for k = 1 
|{w(t + iw) — ao} — {wa (t + iu) — ao}*| 
= O{|w(t + iu) — w(t + iu)|} (83) 
= O(e-%), 


So it follows from the Paley-Wiener Theorem (with Theorem 3) that 
the Fourier transforms of z(t) and z.,(t) agree over (—~,B) where 


B = min {a,(n + 1)rQ}) > Q. (84) 
Therefore 
z(t) = f. z,(s)Kop(t — s)ds (85) 
where Kop is any function of L, satisfying 
iv Koplt)e“dt=1, OSw< (86) 
= 0, w = B, 


Thus for band-pass signals we have the option of replacing the inverse 
function ¢(w) by an equivalent entire function ¢*(w) so that it does not 
matter whether or not the received analytic signal w,(t) is in the range 
of the inverse function ¢(w). In particular, ¢*(w) may be a polynomial 
of degree n where n is roughly the ratio of the upper and lower cut-off 
frequencies of the base-band signal. 


Vi. DETECTION OF EXPONENTIAL MODULATION 


The exponential modulation law f(z) = e? offers the unique advantage 
of eliminating the need for preliminary in-phase and quadrature de- 
tection of the received single-sideband signal op(t). In this case we 
have 


z= yo(w) = log w (87) 
or using Log to denote the real part of the logarithm, 

x(t) = Log |w(t)| (88) 

y(t) = arg |w(t)}. (89) 


We may regard either x(t) or y(t) as the signal to be recovered. 
The transmitted signal is 


a(t) = Re exp {iw.t + z(t)} (90) 
and 


o(t) +1a(t) = exp {iw,t + z(t)} (91) 
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where a(t) denotes the Hilbert transform of c(t). The envelope of a(t) 
is 


Eio(t)} = |o(t) + 1a (t)| = e* (92) 
The instantaneous phase of o(t) is 
Afo(t)} = arg {o(t) + 1g(t)} = wet + y(t) (93) 


For perfect transmission; i.e., og (t) = o(t), we have 
x(t) = Log E{o(t)} (94) 


and using an ideal discriminator (FM detector) we obtain 


y(t) =< Alolt)} = oe (95) 
Replacing o(t) by or(t) we have 
Xq(t) = Log Efor(t)} = Log |w,(t)| (96) 
? d d 
Ya(t) = ap oR} a ae Arg {wa(t)}. (97) 


Now if w,(7) is zero-free in the uhp, then 
Zq(T) = log Wa(7) (98) 


is analytic in the uhp and by the previous theory the Fourier transforms 
of z(t) and z,(t) agree over (—~,qa). In this case the Fourier transforms 
of x(t) and x(t) agree over (—a,a). Also the Fourier transforms of y,(¢) 
and y’(t) agree over (—a,a). So if the received analytic signal w(t) is 
zero-free in the uhp, x(t) may be recovered by taking the Log of the en- 
velope of the received signal and then filtering with an ideal low-pass 
filter having unity transmission in the band [—Q,Q] and zero transmission 
outside the band [—a,a]. Similarly y’(t) may be recovered by filtering 
the output of an ideal discriminator acting on the received signal. 

Later, in examining the bandwidth requirements of single-sideband 
exponential modulation we give sufficient conditions for w(t) to be 
zero-free in the uhp so that the simple detectors described above may 
be used. 

The simple detectors can always be used with appropriate pre-de- 
tection and post-detection filters, since for sufficiently large b, w,(7 + 
1b) will be zero-free in the uhp. We can give an estimate for b under the 
condition 

sup |z(t)| < m. (99) 
t 


We have 
e-™ < |w(t)| <e”™ (100) 
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and consequently 


e-™ <|w(t+iu)|<e", uo. (101) 
Also, w(r) is zero-free in the uhp and hence [cf. (219)] if 
[w(t +ib) — w(t +ib)| <e-™ (102) 


then w,(r + ib) will be zero-free in the uhp. From (44) we have 


|w(t + ib) — wa(t + ib)| < e~@° sup |w(t) — w,(t)| (103) 
t 


and since 
w(t) = f. e)\K VG =\8)ds (104) 
we have 
sup |w.(t)| as sup |w(t)| 7 Kells Ss e™|lKallr (105) 
t t 
and hence 
sup |w(t) — wa(t)| S {1+ |Kallije™. (106) 
Thus if 
etl + ||Kalli} <e72" (107) 


then (102) will be satisfied. That is, w.(7 + 1b) will be zero-free in the 
uhp for 


,, 2m + log f+ Kull 


Qa 


b (108) 
Thus if (99) is satisfied we may use a frequency-translated Poisson filter 
with parameter b satisfying (108) to obtain opr(t;b) [cf. (66)] and then 
operate on the envelope and phase of or(t;b) as before. Then the ap- 
propriate post-detection filtering may be employed to recover x(t) and 
y(t). 


Vil. NOTE ON THE FACTORIZATION OF CERTAIN POSITIVE FUNCTIONS 


The detection theory for SSBEM has important application to the 
problem of factoring certain positive functions of exponential type, i.e., 
certain positive bandlimited functions. 

Voelcker?* has proposed a scheme for demodulating conventional 
single-sideband signals via envelope detection. Conventional SSBAM is 
characterized by linear modulation; i.e., f(z) = z. There is no bandwidth 
expansion so we assume that the Fourier transform of z(t) vanishes 
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outside [0,0] and the channel is such that the received signal is simply 
the transmitted signal; i.e., 


on(t) = o(t) = Re e'cfz(t). (109) 


The envelope of the received signal is |z(¢)|. Voelcker’s scheme requires 
first that z(7) be zero-free in the uhp in order that z(t) may be recovered 
from |z(t)|. This is insured by requiring Re z(t) = x(t) > 0. We have 


2(t + iu) = f. 2(s)P,(t — s)ds (110) 
and since the Poisson kernel is positive, 


“Pie adeS © (111) 


Rez(t+iu) = f. 


Therefore z(7) is zero-free in the uhp. The function log z(7) is analytic 
in the uhp and with some additional conditions on z(7), e.g., 


lim z(t + iu) = 1, (112) 


the imaginary part of log z(t) can be determined from Log |z(t)| and 
hence z(t) can be recovered from |z(t)|. In particular, if 


x(t)=1+ g(t), (113) 
where 
g(t) > —1 and g(t) belongs to Lp, (1 <p < ~), (114) 


then Log |z(t)| will belong to L, and will therefore have a Hilbert 
transform. A more attractive condition for recovering x(t) is the condi- 
tion 


g(t) > —1 and g(t) of band-pass type. (115) 


Then if (115) is satisfied, the Fourier transform of {z(t) — 1} vanishes 
outside [rQ,Q], where 0 < r < 1, and hence 


w(r) = log z(r) (116) 
is analytic in the uhp and satisfies 
w(t + iu) = log [1 + {z(t + iw) — 1}] 
= Of|z(t + iu) — 1|} = O(e-"24), Uu —> ©, 
Therefore, if (115) is satisfied, the Fourier transform of w(t) vanishes 
over (—~,rQ) and hence the Fourier transforms of Log |z(¢)| and 
arg {z(t)} vanish over (—rQ,rQ). That is, if (115) is satisfied, then the log 


of the envelope of z(t) and the phase (arg) of z(t) are high-pass func- 
tions. 


(117) 
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If either (114) or (115) are satisifed we have 


— ya L° Los lz 
arg ie(Ne elt) =f ha (118) 
and then 
2(t) = |z(t)let*®). (119) 


The practical problem encountered here is in approximating the 
Hilbert transform in (118). The function Log |z(¢)| is not band-limitedt 
and the implementation of (118) requires a filter whose frequency 
characteristic is, formally, 


@ —Iwt a 
Higy= f Spee f SUN Ol ey ere. 490) 


o Tt at 





(-~7 <w<o~) 


These stringent filter requirements can be avoided, for we can, by proper 
application of the previous theory, ignore the frequency content of 
Log |z(t)| outside the band (—a,a) where a > Q. Actually if (114) is 
satisfied we may take a = Q. 

The Hilbert transform problem is simplified if we begin with a filtered 
version of Log |z(t)]|; viz., 


Ae(t) = f. “ Log |2(s)|he(t ~ 8)ds (121) 


where h,(t) is an even real-valued function whose Fourier transform 
satisfies 


h.(o) = f "h(tle'dt=1 for-wsw<a. (122) 


We suppose further that h,(t) is sufficiently smooth to have a Hilbert 
transform h,(t). Then the Hilbert transform of \,(t) is given by 


a(t) = alt) = f° Log |2(s)|fa(t — s)ds (123) 


= f. " os)ha(t — s)ds. 


Then defining 
Walt) = Alt) + igq(t) (124) 
we have 


may f. : log f2(s)ih.(t —s) ds. (125) 


t Unless z(t) = constant. See Theorem 6 in Section IX for a stronger statement. 
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Then according to the previous theory 


z(t) = f. k(t —s) exp {w,(s)}ds (126) 

where k(t) is any function of L; satisfying 
f. k(the-tdt = 1, 0<w<2 (127) 

= 0, w2a, 


In case (115) is satisfied the function h,(t) need satisfy (122) only over 
the intervals (rQ,a) and (—a,—rQ), since the Fourier transform of 
Log |z(¢)| vanishes over (—rQ,rQ). That is, h,(t) can then be chosen so 
that the equivalent Hilbert transform kernel /,,(t) has a Fourier trans- 
form that is more easily approximated (within a linear phase factor 
e—'T) by practical filters. 


Vill. NOTE ON LOGARITHMIC COMPANDING 


Suppose g(t) is a function belonging to L, for some p satisfying 1 < 
p <~ and suppose the Fourier transform of g(t) vanishes outside [—,]. 
Companding functions f are sometimes used to compress the range of 
g(t) for transmission; i.e. f{g(t)} is transmitted rather than g(t). Landau 
and Miranker!!.!2 showed that g(t) (in Lz) can be recovered from the 
bandlimited version of f{g(t)} with suitable conditions on f. The recovery 
is accomplished by an iterative scheme. Here we use the detection theory 
to give an explicit solution to the problem of Landau and Miranker for 
the case 


f(x) = 5 Log (l+x), x>-1. (128) 


Accordingly, we further require g(t) to satisfy 
g(t) >-1. (129) 


The function f(x) given by (128) is not an odd function, as one might 
desire for companding purposes, but is interesting because the recovery 
problem is simple. 

The fact that g(t) is a bandlimited function belonging to Lp for some 
p satisfying 1 < p < ~ implies [cf. (29)] 


_ e sin Q(t — s) 
g(t) = i g(s) Ser ds (130) 


from which one can conclude with the aid of Hélder’s inequality that 


lim g(t) = 0. (131) 


t>+to 
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It follows from (129) and (131) and (128) that 
f{g(t)} belongs to Lp. (132) 


Since g(t) is bounded and belongs to Lp it follows that g(t) belongs to 
Ly for every p’ satisfying p S p’ < ~. Hence f{g(t)} also belongs to Lp. 
for such p’. 

Now we suppose we are given 


_ 2 sin Q(t — s) 
o(t) = f. flg(s)) a (133) 


where the integral is absolutely convergent by Hélder’s inequality. In 

fact (Ref. 16) A(t) belongs to Lp, and therefore has a Hilbert transform. 

Furthermore, since g(t) is bandlimited, its Hilbert transform is given 

by 

1 — cos Q(t — s) 
a(t —s) 


= {Agi Sa pee) a 


Ka(t) = galt) = f. ~ dals) ds (134) 


Defining 
wo(t) = Ag(t) + i¢ga(t) (135) 
we have 
walt)= f- flg(s)IKalt ~ s)ds (136) 
where 
ros (137) 
Unt 


So wa(r) is an entire function which is bounded in the uhp. 
Now {1 + g(t)} is a positive bandlimited function which can be repre- 
sented as (Theorem 7.5.1 with Theorem 6.4.5, Ref. 6) 


1+ g(t) = y(t)y() (138) 


where the Fourier transform of y(t) vanishes outside [—Q/2,Q/2] and y(r) 
is zero-free in the uhp. Then z(¢) defined by 


2(t) = y(t)ei@t/2 (139) 


is a function whose Fourier transform vanishes outside [0,Q] and z(7) 
is zero-free in the uhp. Thus we have 


1+ g(t) = |2(0)[2. (140) 
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We may assume that [cf. (131)] 
lim z(t) = 1. 
tte 
Then z(t) is given by 
z(t) = exp w(t) 
where 
ds 


a(r—s) 





w(r) =< {Log {1 + g(6)} 
and 


lim w(t + iu) = A(t) + ig(t) 
u->0+ 


A(t) = 5 Log (1 + g()} 


g(t) = X(t), the Hilbert transform of X(t). 
We see from (136) and (143) that 


Wa(t) = f. w(s)Ko(t — s)ds 


- f. ” log {z(s)\Ko(t — s)ds. 


Then the Fourier transform of zg(t) defined by 
z(t) = exp {wo(t)} 


(141) 


(142) 


(148) 


(144) 


(145) 


(146) 


(147) 


(148) 


agrees over (—~,Q) with the Fourier transform of z(t). Since {z(t) — 1} 
belongs to Lp, and its Fourier transform vanishes outside [0,Q] we 


have 
z(t)—1= ff feals) oa 
Writing z(t) = x(t) + iy(t) we have 
x(t) =1+ f. “fede cos gals) = as 
y(t) = tic ehals) a gals) aes ds 


Then we have 
g(t) = x2(t) + y(t) — 1. 
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(149) 


(150) 


(151) 


(152) 


Thus the recovery problem is solved by means of the Hilbert transform 
in (134) and the formulas (150)—(152). 

Expressing the solution in terms of the bandlimiting operator Bo, 
defined for h in Ly, 1 < p < ~, byt 


Baht) = f"_h(s) a ds, (153) 
the resulting class of functions denoted by B,(Q), we have 
g(t) = [Bo exp {ro(t) + iXo(t)}|? — 1 (154) 
where \Qg(t) is the given function 
holt) = 5 Bo Log { + 4(6) (155) 


ginB,(Q), g>-l, 


and Xqg is the Hilbert transform of Xo. 

The solution (154) is deceptive in that it suggests that Ag(t) may be 
any function of B,(Q), 1 < p < ~, since g(t) given by (154) is a function 
of Bp(Q) satisfying g > —1. However the solution was obtained on the 
premise that Ag(t) is a given function of the form (155). All functions in 
B,(Q) do not have the representation (155). The crucial point is that the 
function 


z(t) = Bo exp {Ag(t) + iro(t)}, {z(t) -—1Jin Bp(Q), (156) 


whose Fourier transform vanishes outside [0,Q] should extend as a 
function zero-free in the upper half-plane. Then, and only then, ac- 
cording to the general theory, will we have 


Ba log {z(t)} = Ao(t) + ie(t) (157) 


and hence 
1 : 
Bo Log |z(t)| = 5 Pa Log {1 + g(t)} = Ag(t), gin Bp(Q), (158) 


Poi, 


On the other hand, if (158) is known to hold, implying (157), then z(t) 
must necessarily extend as a function zero-free in the upper half- 
plane. 

We state this important result as 


Theorem 4. Given a function X(t) in Bp(Q), for some p satisfying 1 S 


t The operator Bg can be extended to certain other classes of functions. For example, Bo 
is an identity for the constant function, which fact is used in (154). 
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p < ~, the equation (155) has a solution g(t) in the same class Bp(Q) 
satisfying 


g(t) >—-1 
if and only if the function 
z(t) = Bo exp {Ag(t) + idg(t)} 


where Xo(t) is the Hilbert transform of \o(t), extends as a function 
zero-free in the upper half-plane. Then the solution of (155) is given 
by (154). 


IX. BANDWIDTH REQUIREMENTS FOR EXPONENTIAL MODULATION 


We have seen that for a wide class of analytic signals z(t) and modu- 
lation laws f(z) the bandwidth requirement for transmitting f{z(t)} and 
recovering z(t) is Q + « (for any e > 0) where 2 is the bandwidth of z(t), 
provided we allow the use of Poisson filtering at the receiver. In case the 
inverse function z = ¢(w) is an entire function there is no need for 
Poisson filtering. If we look at the overall system design, as contrasted 
to a detection problem, it is reasonable to ask for the bandwidth re- 
quirements for a given f(z) and a fixed receiver, namely the inverse 
function y(w) followed by a low-pass filter, such that we recover all z(t) 
whose Fourier transforms vanish outside [0,] and satisfy some sort of 
norm constraint, say |z(t)| < m. The problem then is to specify a channel 
of finite bandwidth, i.e., a function K,,g(t) in Ly, with 2 < a < 8, satis- 
fying 


Rug(w) = ff. ” Kap(the-i#tdt =1, 0S w<a (159) 
= 0, w> Bp 
such that the received (bandlimited) analytic signal w.4(t), given by 
Wa.g(t) = f. flz(s)\Ka.g(t — s)ds, (160) 
satisfies 
Q{W «,g(7)} analytic in the uhp (161) 


for all z(t) whose Fourier transforms vanish outside [0,Q] and which 
satisfy 


|z(t)| Sm, -o<t<o, (162) 


We assume of course that f(z) is analytic for |z| < m. We would like to 
make the channel bandwidth 6 as small as possible consistent with (161) 
and (162). Clearly, we may take 2 = 1 with no loss in generality. We de- 
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fine the minimum bandwidth Bo(m) as 
Bo(m) = inf B(m) (163) 


where the infimum is over all functions K,,g(t) subject to (159), (161), 
and (162) with Q = 1. In taking the infimum we may allow a = 1. 

The determination of 8o(m) is in general a very difficult problem. We 
give some estimates here for Bo(m) for the case f(z) = e?, g(w) = log w. 
In this case, (161) is satisfied if and only if 


Wa,g(7) is zero-free in the uhp. (164) 
We have 
w(t) = exp{z(t)} (165) 
and with (162) 
e-™ < |w(t)| <e™. (166) 


If wag(t) is sufficiently close to w(t), (164) will be satisfied; i.e., a 
sufficient condition for (164) is 


|w(t) — Wag(t)| <e7™. (167) 


It is intuitively obvious, with the freedom we have in defining Kg, g, 
that for sufficiently large 6 we can find a function K,g(t) such that 
Wa,g(t) given by (160), with f(z) = e?, will satisfy (167). It is important 
to note in this connection that, although the definition of Ky,3(w) for w 
< 0 does not affect wa,g(t), we are free to define K,,g(w) for w < 0 (as well 
as for a < w < 8) in the most favorable way to obtain the estimate 
(167). 

First we obtain lower bounds for Bo(m). 


9.1 Lower bounds for B){m) 


We can obtain a lower bound for 8o(m) by taking 


z(t) = me", (168) 
We have 
, omer . 
w(t) = exp {me#4} = > —— eit, (169) 
r=o0 R! 


Now assume the channel cutoff frequency 6 satisfies 
n<Bsnt+l (170) 


where n 2 1 is an integer. Then 


Wa,g(t) = 1+ met+ > apetrt (171) 
k=2 
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where the a, depend on m and the definition of K «,g() forl<w<n. 
We have 


oe fe 
wap(t) = IT (1 -) (172) 
where ¢ = et and 
nj 
ae = —m. (173) 


We require W,,a(t) to be zero-free in the uhp; i.e., 


lt] 21. (174) 
Thus 
n jd n 1 
m= S| SO aS a (175) 
k=1 =| r=1 | Sp| 





Therefore, we must have 8 > n = m in order for wWa,g(t) to be zero-free 
in the uhp. Then 


Bo(m) > [m]* (176) 


where [m]* is the smallest integer which is not less than m. 


9.2 Lower bound for small m 


We know that 6o(m) > 1 for any m > 0 but (176) does not say how 
much 6o(m) must exceed 1 for 0 < m < 1. For sufficiently small ¢ and 
correspondingly small m we can show that 69(m) > 1 + «. 

For small m, we have 


w(t)=1+2z(t) + a + O(m?). (177) 


Now by Corollary 2 of Theorem 2 the Fourier transform of z2(t) vanishes 
outside [0,2] and we would like to find a z(t) such that a channel filter 
with a sharp cut-off, i.e., 8 = 1 + ¢, acting on a small z°(t) gives a large 
(negative) output at t = 0. For sufficiently small ¢ and fixed m we can 
accomplish this by taking 





it/ 
z(t) —— = : (1 + iS, (€t)} (178) 


ae 


where S, (t) is a sine polynomial, 


S G)= > dessin kt (179) 
k=1 
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with real coefficients a; and 
max |S, (t)| = 1. (180) 
Also we require 


1 
= 181 
ne A (181) 


so that the Fourier transform of z(£) given by (178) vanishes outside [0,1]. 
Also we have 


max |z(t)| =m (182) 
and 





it/2 
i {1 + iS, (et)} 


Te 
+ el — S2(et) + 2S, (et)} (183) 


w(t)=1- 


+ R3(t), 


where 

R(t) = 

a(t) 2X, k! 
1 





(184) 


3 
|R3(t)| < ————_,_ (m <4). (185) 
0-3) 
4 


We have 





it/2 
me {1+ S,, (et) 


walt) = J wOKaalt — dé = 1-7 


2 <o 
ae J... eif{1 — S2(cE) + 2i8,(cé)\Kag(t Ede (186) 


+f) RalOKaplt ~ ade 


and We,g(t) is a polynomial of degree 2n in exp (iet) where « = 1/2n. We 
have 


2ie'tS,, (et) = —ett 3 a,e ket (187) 

k=1 

. n e 

+ elt > apetket 
k=1 
. . 2n . 

etS2(et) =e! YD bpetket, (188) 

k=-2n 
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So 





meit/2 ; 
wa,a(t) = 1 — [1 + iS) 
pele {1 — pit > bpeiket — eit > aye itet} (189) 
4 k=-2n k=1 
+ rs(t) 
where 
rat) = RalOKaalt ~ £48, (190) 


Since R(t) is a periodic function of the form > c,e~'*«t we may take 
for Kq,g(t) any function of L; whose Fourier transform satisfies 
Kag(w) =], O<w<l 

= 0, w>BrHlte 
It is shown in Appendix B that there exists a function K,,(t) whose 
Fourier transform satisfies (191) with 


(191) 


f. |Kag(t)|dt <1+ igs (a + -): (192) 
—o W 3e 
Thus 
[ra(t)| < max |Ra(t)| f |Kq,g(t)|dt (193) 
3 
<—t [3 fgg (1 +3]: 
T 3€ 


“(0-9 


We have from (189) 


jet G'S eee (194 
Wa,a0) = V2 4 | cs x an} ) 
+ r3(0). 

Now S2(t) is an even function and S,,(0) = 0. Thus 
2n 
os bp = 0, b_p = bp (195) 
kR=-2n 
-1 2n —b 
Y be= > b= (196) 
k=-2n k=1 2 
0 b 
~ b== (197) 
k=—-2n 2 
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and 
1 T 7 1 n , 
a ff. S2t)dt == 4 a2 <1. (198) 
on —7r 2 k=1 


Now we may take S,,(¢) to be an approximation to sgn{sin ¢}. In particular 
we may take S,,(t) to be the function found by Szeg6!9 which maximizes 
1 ap subject to (180). He gives the inequality 





n 2 
> an, <M, = > cot (5) ~-logn, ne, (199) 
1 ra 


n+1lig<ke<n 2(n + 1) 
k odd 


Here we should not identify a; with the terms in the second sum. For 
equality in (199) we must have 





s, (42) =1, 1<k<n, kodd. (200) 
n+1 


Equality in (199) is attained for 


S(t= ¥ [in (¢- 75) - In (t+ ud )} (201) 











1<k<n n+1 
k odd 
where 
nti 2 
sin t 
Lt). = (202) 
t 
+ 1) sin - 
(n ) sin 5 
It is easy to show that 
Se (pe "Vag, Bavacead 203 
k=—-n n( a) pene ( ) 
(k odd) 





bs I (¢- sas $i CC 9) ea) 204 
k=-(n-1) " n+ 7 a “ . m)} (204) 
(k odd) 


=I: for n even 


It follows from (203), (204), and (201) that S,(t) given by (201) satis- 
fies 


—-1<S,(t) <1. (205) 
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From (201) and (202) we find that 

















. okt 
sin? — 
4 k 2 n odd 
a, = = sear cae 
: it a) ke kh =1,2,...,n (206) 
sin 
n+1 
: nkx \2 
4 k (sin 2(n + i n even 
a, =—— (1- | en (207) 
n+1 n+1 *) tbe k=1,2,...,n 
sin 
n+l 
It is shown in Appendix C that M,, given by (199) satisfies 
2 
M, >-—logn 22 (log = + 7) (208) 
Tv Tv T 
where 
y = 0.5772... (Euler’s constant). (209) 


So with S,(t) given by (201) and z(t) given by (178) we have from 
(193), (194), (197), and (208) 


2 
weg(0) <1~ Tet" (1 — 2-2 ogn ~= (log~ +7) } 


+—™ [1 + tog (i+ >)| (210) 


m v 
3! Gas) 
4 
1 


1 
€ 2n 


where 1 <a < 8, 


bo=5- [ Skedt~1asn— 
Qar —T 


Now if we set 


2 
it logn =1 (211) 
Qa 
it is clear that 
Wa,e(0) < 0 for sufficiently large n. (212) 
Since 
2n P 
Wa,e(t) = Y° cpetht/2n (213) 
k=0 
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where the cy are real and co > 0, we have w,,g(iu) real and 


lim Wa,g(iu) = co > 0. (214) 


uo 


So (212) and (214) imply that w.,4(t) has at least one zero on the positive 
imaginary axis; i.e. for sufficiently large n, and m given by (211), we must 
have 6 > 1 + 1/2n (to pick up at least another harmonic) in order for 
W.a,g(t) to be zero free in the uhp. Thus 


On 
Be ( ia >1+ - for sufficiently large n. (215) 


In connection with obtaining lower bounds for fo the idea comes to 
mind that we might be able to find a z(t) satisfying |z(t)| < m (for suf- 
ficiently large m) such that the Fourier transform of w(t) would vanish 
over a large interval (1,8). Then if w. g(t) were not zero free in the uhp 
we would have fo > 8(m). The idea is to obtain a wa,g(t) which would 
be independent of the choice of K,,g(t). However, we cannot make the 
Fourier transform of w(t) vanish over large intervals unless z(t) = con- 
stant. 


Theorem 5. Suppose z(t) is a bounded continuous function whose 
Fourier transform vanishes outside |[0,Q|, and suppose that the Fourier 
transform of w(t), where 


w(t) = exp {z(¢)}, 
vanishes over (a,b) where 
a20 and b-a>Q. 
Then 
z(t) = constant. 
A similar result holds for the logarithmic function. 


Theorem 6. Suppose z(t) is a bounded continuous function whose 
Fourier transform vanishes outside [0,Q] and suppose its analytic 
continuation z(t) satisfies 


|z(t+iu)]=>e>0O foru20 
—-ex<t<om. 


Suppose further that the Fourier transform of w(t), where 
w(t) = log {z(¢)} 
vanishes over (a,b) where 


az0, b-a>Q. 
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Then 
z(t) = constant. 
As an application of the last theorem, we may take 2 = n and 


z(t) = [] (1— get?) 
k=1 


Then, assuming |\;| <1, we have 


Umeimt 





log z(t) = — 3 
m=1 m 


where 
Lm = > (Ap )™ 
k=1 


Then the following is true. 


Corollary. If {Xp}, k = 1,2,...,n, is any set of n complex numbers and 
n 
~ Oxy)" = Oform=p,p+1,p+2,...p+n-1 
k=1 
where p is a positive integer, then hy = 0,k = 1,2,...,n. 


Proofs of Theorems 5 and 6 are given in Appendices D and E. 


9.3 Upper bound for B o(m) 


We have 
w(t)=e? and |z(t)| <m (216) 
so {w(t)}—! is bounded and analytic in the uhp. Thus the quotient 
Wa,p(t) _ 1+ we g(t) — w(t) O17 
w(t) w(t) a 


is bounded and analytic in the uhp, and is reproduced by the Poisson 
kernel from its values on the real line. Then if 


nO ue) 2ifirve<t<e (218) 


the function w,,g(t) is necessarily zero-free in the uhp. Thus a sufficient 
condition for wa,g(t) to be zero-free in the uhp is 


|w(t) — Wa,g(t)| <e7™. (219) 


For lack of something better we will use this condition to obtain an upper 
bound for 8o(m). To meet this condition for large m will require con- 
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siderably more bandwidth than the lower bound for 8o(m). In fact for 
the case z(t) = me? we have 


o mk | 
w(t) = > —eikt (220) 
r=0 R! 
and 
: [8] a 
Wap (t) =1+met+). a,etkt (221) 
2 
where the a; depend on the definition of K,,g(w) for w > 1. Since 
k ra 
= an ff. {w(t) —waglt)jje**dt, k= Bp (222) 
k! Qa —T , 
we have 
mk 
ht < max |w(t) —wag(t)| wherek 2 8. (223) 


Hence in order to satisfy (219) for the case z(t) = me** we must have 


max 


tir 
k2=B 


a <e-™, (224) 


For large m we must have £ large since 


k! < V2ark keke l/ 12k, (225) 
We find for large m that 


B>pm+o(m) (226) 
where p is the root of 
p =eltl/o = 3.591121477. 


Thus for large m the upper bound we obtain for 89(m) from the condition 
(219) must be something like 3.6 times as large as the lower bound (~m) 
we obtained previously. We can in fact obtain an upper bound for Bo(m) 
that is close to pm for large m and, as it turns out, is close to the lower 
bound for Bo(m) as m — 0. 

To do this we suppose that 


a=n<8 (227) 
where n is a positive integer. We take 


Kup(w) =1, 0<w<n 





2-8 n<wse (228) 
Bn 
= 0, w> Bp. 
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K,.g(w) is defined for w < 0 in such a way (see Appendix B) that 


= 1 4n 
f. |Kq,e(t)|dt “leg hee | ca (229) 
Now we write 
w(t) = e?) = P,{z(t)} + > le(t)y* (230) 
k=nt+l1 k} 
where 
nm fe(t) (231) 





Pr lz(t)} = Pa hI 


Since by Corollary 2 the Fourier transform of P,,{z(t)} vanishes outside 
[0,n], we have 





Weg(t) = f. - w(s)Ko,g(t — s)ds (232) 
= P,{z(t)} + Rnsilt) 
where 
Rnsilt) = fi [= | Kuslt- sas (233) 
Fo <{ 5 f” KaoWlat (234) 
Thus 


a ak eo 
Jot) — wae] <{ 3 ieee IK,o(¢)|dt} (235) 
Therefore if 
id a |Ka,p(t)|dt <———> = Qa(m) (236) 
k=n+1 i . 


Wa,e(t) will be zero-free in the uhp. Then from (229) and (236) we can 
get an upper bound on @ for each choice of n. Since 


ff. " Ky g(t)e-i#tdt = 1 for0<w<a (237) 
it follows that 


i ” |Ka g(t)|dt > 1. (238) 
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So the inequality (236) can hold only if m and n are such that the 
right-hand member of (236) is greater than 2. Then setting 


An (m) = max {0,[Qn(m) mi 2]} (239) 
we see from (229) and (236) that 


1 4n 
—log j1+ Tear = A,(m) (240) 
v 3[Bn (m) — n| 
defines for each positive integer n an upper bound for 8o(m), viz., 
4 
—n 
3 


Bn(m) =n+ (241) 


etAn(m) — 1° 


We take 6,,(m) = © for A,(m) = 0. For fixed m, we have A, (m) > 0 for 
sufficiently large n. Then 


Bo(m) < B(m) =min B,(m)_ n= 1,2,3,... (242) 

Since 8,(m) > n, it is clear that the minimum in (242) will be 8,;(m) 
for sufficiently small m. We have 

Ax(m) = 2m-2+ O(m-}),. m0. (243) 


So the upper bound 8;(m) for small m compares favorably with the lower 
bound (215). At least we have the dominant exponential behavior pinned 
down as m — 0; i.e., 


lim m2? log {89(m) — 1} = —2z. (244) 


m—>0 
The function A,,(m) defined in (239) behaves like (n + 1)!/m"*t! as m 
— 0 and decreases to zero at m = m, where 


(n + 1) 1 
+ 
p 2(1 + p) 





mM 
~ 


n 








1 1 p-1 
1 +1) +— }- log 27 +1 
of (n +1) ree og 2x + log | 


(245) 
and p = 3.591121477 is defined in (226), 


: = .278464543 
p 


1 
2(1 + p) 





= .1089058525 





1 aaa F 
oS F jog oe toe” | = —,0219080253. 
1+p (2 2p 
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Bim) 





Fig. i—Upper bound for transmission bandwidth required in ESSB for simple detection 
of signals z(t) whose Fourier transforms vanish outside [0,1] and satisfy |z(t)| < m. 


The behavior of A,,(m) is such that 8,,(m) defined in (241) for 0 < m < 
My is very close to n over most of this range and increases suddenly as 
m — my. Consequently, the upper bound B(m) defined in (242) is 
roughly a staircase function as shown in Figure 1. For 0 < m < .62 we 
have B(m) = 8;(m). In Figure 2 a graph of logi9 {B(m) — 1} is plotted for 
48 < m S .62. It is seen that only .1% increase in transmission bandwidth 
is required for m < .48, and 10% increase suffices for m < .57. We know 
that 8o9(m) > 2 for m > 1, so without Poisson filtering SSBEM is inter- 
esting perhaps only for |z| < .6. 


X. A POLYNOMIAL PROBLEM 
To each polynomial 


n 
P,()) = 1+ Y aps (246) 
k=1 
we can assign a positive integer v which is the smallest integer =n such 
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logio {B(m}-1] 





Fig. 2—Logarithmic expansion of Fig. 1 for small m. 


that for some choice of a; fork =n +1,n+ 2,..., v, the polynomial 
n v 
PAO ey, Oe Sa (247) 
k=1 n+1 


is zero free for || < 1. The integer v is some complicated function of the 
coefficients a;,, k = 1,2,...n. The fact that v is finite is a rather re- 
markable fact that follows from the previous theory. To see this we 
set 


f=ret, r>0. (248) 


For sufficiently small r we have P,,(re‘) zero-free in the uhp. Then 
taking 


Q(re't) = log P,, (re**) (249) 


= 5 bprkeikt 
1 


ANALYTIC MODULATION SYSTEMS = 535 


and 


Qn (reit) = = byrteikt (250) 
we have 
exp {Q, (e)} = 1+ ¥ cpetkt (251) 
where 
Ch =Qp, k=1,2,...,n. (252) 


Now we identify Q, (e‘“/") with z(t) in the previous section where we 
obtained upper bounds on 6o(m). We know we can bandlimit exp 
{Q,(e#)} to obtain a function of the form 


n . N : 
L+ > cre? + > dzpei*t 
1 n+1 


which for sufficiently large N is zero free in the uhp. In particular, we 
have shown that this is possible for 


N < nB(m) (2538) 
where B(m) is defined in (242) and 


m = max|Q, (e!“)|. (254) 
t 
Thus 
vy<N<nB(m). (255) 
Given the az, or equivalently the bz, fork = 1,2,...,n, we are inter- 
ested in obtaining a lower bound for ». 
Writing 
Pr (O = iu (1 — Ars) (256) 
where |Az| < 1 
we have 
log Prd) = ¥ bas, |] <1 (257) 
where 
b= —- = Oe (258) 
k j=1 
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Since |A,| < 1 we have |b;| < v/k and hence 


ype max label, ee Oe onthe (259) 
Consider, for example, the case 
P,(9 =1+mg., m20 (260) 
Py (SPL tm + Gai Fc a: (261) 
We have 
log Priv (Q = mo" + bng tht... (262) 


So for the case (260) we have 
p=2=nm. (263) 


This inequality is clearly best possible in case m is a positive integer. Now 
suppose m = 1 + e, where 0 < « < 1/n. Since p is an integer we conclude 
from (263) that vy = n + 1. However, we can show (see Appendix F) by 
another method that m > 1 in (260) implies vy = 2n. Then it is apparent 
from the example 


Pron() =1+mor+ ON eal 


(with appropriate choice of ao,) that vy = 2n for 1 <m < 2. It is conject- 
ured that this large jump in y at m = 1 also occurs at all integer values 
of m, but we have not been able to show, for example, that m > 2 implies 
p= 3n. 

In order to improve the lower bounds obtained for 8o(m) we are in- 
terested in maximizing the ratio y/n subject to the constraint 


n . 
by + > bpet*t 
rfl 


max <m. (264) 
t 








For any choice of bz, k = 1,2,...n, we are free to choose b, so as to 
minimize the maximum modulus of the sum. That is, in the bandwidth 
problem of the previous section we take 2 = n and 


2(t)= y. bpeikt (265) 
k=0 


with the constraint (257). Then assuming n <a <n+1l,andvy<6<S» 
+ 1, we have 


n , v . 
waglt)=e% 11+ So apettet axer™| (266) 
k=1 k=n+1 


=ebo T] (1—dyet), Jrx| <1. 
k=1 
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Now we want lower bounds on » implied by (258), subject to (264), giving 
us 89(m) > v/n. For a given n (large) we would like to choose the bz so 
as to maximize v. For a particular choice of by we can in principle de- 
termine the minimum value of v required to satisfy (258). In order to do 
this we may assign values to the (n — vy) coefficients az,k =n+1,..., 
v, in (264) and see if it is possible to make |d;,| < 1 for k = 1,2,...,v. 
Recall that the first n coefficients are determined by 


n . . . 
exp | DS beitt| =l1+aje4¥4+...t+a,e™+... (267) 
k=1 


Perhaps a computer study could shed some light on this very difficult 
and challenging problem. 


APPENDIX A 
Proofs of Theorems 1, 2, and 3 


In view of Ex. 2 in Sec. 2.2 it is sufficient to prove Theorem 1 for the 
interval (—~,0). So we assume first that g(t) is a bounded function whose 
Fourier transform vanishes over (—~,0) and we wish to show that g(t) 
is the boundary value of a function bounded and analytic in the upper 
half-plane. For this purpose we define 


gult) = f. g(s)P,(t —s)ds, u>0 (268) 
where 
P,(t) = eer (269) 
We have 
len@ sf le()IIPule - s) las. 
Hence 
sup lgu(t)| < ib lg(t)|. (270) 
Also 
ion £u(t) = g(t) for almost all t. (271) 


Now all we have to show is that g,,(t) is an analytic function of 7 = t + 
iu. Since the Fourier transform of g(t) vanishes over (—~,0) we may 
replace P,,(¢) in (268) by any function of L; whose Fourier transform 
agrees (for each u > 0) over (0,~) with that of P,,(t). In particular, we 
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may replace P,,(t) by the analytic kernel, 
} 1 4 ; 
K(t + iu) = = f Flweittinds, u>0O (272) 
viv —a 


where F(w) =1 for w20 
=wt+l1 for —-l1<w<0 
=0 for w<-l. 
The definition of F(w) for w < 0 is important only to the extent that the 
integral in (272) converges for u > 0 and implies 


f. |K(t +iu)|dt<o for u>0O. (273) 


It is sufficient for (273) that e~°“F'(w) belong to Ly and have a derivative 
in L» [see (5)]. . 
Thus we have 


g,(t) = i ie g(s)K(t+iu—s)ds=G(t+iu) (274) 
vie —a 
where G(z7) is analytic in the uhp and from (270) and (271) 


|G(r)| S sup |g(¢)| (275) 
t 
lim G(t + iu) = g(t) for almost all t. (276) 
u—0 
This proves the first half of the theorem and we may as well write 
G(t + iu) = g(t + iu) = g(t). (277) 


Now for the second half of the theorem we wish to establish that if g(7) 
is bounded and analytic in the uhp, then 


f. ” g(t)h(—t)dt = 0 (278) 


for all functions h of L, whose Fourier transforms vanish over (0,~), or 
equivalently 


f ” g(t)h(t)dt = 0 (279) 


for all functions h of L,; whose Fourier transforms vanish over (—~,0). 
To do this we need some lemmas concerning analytic functions belonging 
to L, on lines parallel to the real axis. 


Lemma 1. If h(t) belongs to Ly and its Fourier transform vanishes over 
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(—~,0) then h(t) has an analytic continuation h(t + iu) in the upper 
half-plane u > 0 satisfying 


f. |n(t + iu)|dt < f. |h(t)|dt. (280) 
Proof. We have 
h(t) = J, * h(wei*tde (281) 
with the Fourier integral providing the analytic continuation 
A(e+iu) =f R(werde, u>0. (282) 
Since A(w) = 0 for w < 0 we may write 
h(t + iu) = f. “ A(w)en#llede, wu >0 (283) 
and hence conclude that 
h(t + iu) = f. h(s)P,(t — s)ds (284) 


where P,,(t) is the Poisson kernel defined in (269). Then (280) follows 
from (284), since the L; norm of the Poisson kernel is 1 for every u > 
0. 


Lemma 2. Suppose h(r), r = t + iu, is analytic in the strip0 <u <b 
and satisfies 


f. |n(t + iu)|dt< for 0<u<b. (285) 
‘Then 
h(t + iu) = : Re A(weittinds, Osu<b (286) 
where 
fi(w) = f. _ h(te-tetde. (287) 
Proof: Defining 
h(w,u) = f “h(t + iuje#!dt, OSu<b (288) 
we wish to show that 
A(w;u) = e- “Uh (w). (289) 
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We have 


ACC eee ee ff. ‘ (1-121) iemererae, (290) 


Q—> 2 Qa 


Also, from the analyticity of h(r), we have 
oes ean. 0<u<b. (291) 
ot lL Ou 

We would like to establish (289) from (291) by differentiating inside the 


integral of (290) but at this point we do not know enough about A(w;u) 
to justify the differentiation. Therefore, we will define 


g(t + iu) = f. ” k(s)h(t + iu — s)ds (292) 
where k(t) is a function of L, whose Fourier transform k(w) does not 
vanish for any argument and such that wk(w) belongs to L;. We would 


also like k’(t) to belong to L;. We may take 


k(t) = 





- e~ 07/2, (293) 


T 


Then g(r) is analytic in the strip and 
f. |g(t + iu)|dt < f. |n(t + iu) de. (294) 
We have 
1 re ; 
g(t + iu) = 5 ff hw)h(wu)etde, O<u<b, (295) 
T —a 


and since wk(w) is in Ly, 


6) (ata) 1 f 
- ne ee 
aL” Qa 


 iob(w\h(wu)etde, O<Su<b. (296) 


Now 
} r) , 
g’y (t) =~ g(t + w) =—— g(t + iv) (297) 
ot Lou 
belongs to L; for 0 <u <b since 
g’, (t) = f ” h(s + iw)k(t — s)ds (298) 
and k’ belongs to L;. Hence the function of t 
ait + iu) 
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has a Fourier transform for 0 < u < b. Thus from (295) 
a2 
Ou 


Since k(w) % 0 for —~ <w < ~, we conclude from (296), (297), and (299) 
that 


g(t tiu) = : f_ R(w) ~ h(wiudeietde, O0<u<b. (299) 


ae: h(w;u) = iwh(w;u). (300) 
lL Ou 


Then (289) follows from (300). 
A corollary of Lemma 2 is the following 


Corollary. If g(r), 7 = t + iu, is analytic in the strip a <u < band 
satisfies 


fe lg(ttiu)|dt<o for a<u<b, (301) 
then 
J. : g(t +iu)dt = constant, for a<u<0b. (302) 


The corollary follows by applying Lemma 2 to the function g(t + ia + 
te) for arbitrarily small positive e. 


Lemma 3. If g(7), 7 = t + iu, is analytic in the upper half plane u > 0 
and satisfies 


fe lg(t+iu)|dt<© for uZ0 
then the asymptotic estimate 
f. lg(t + iu)|dt = Ofe-*4} as u>om (303) 
implies 
f. g(t +iw)|dt < e-e f. lg(t)|dt for uw=O (804) 
and 
g(t +iu) = f ' Bweittindy for u>O (305) 
where 


8(w) = f. i e-ivtg(t)dt=0 for w<a. (306) 


542 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


Proof: From Lemma 2 we have the representation (305) so that 


(wee = f ” g(t +iuje-ietdt, (307) 
Thus 
|g(w)e-#"| < fe le(t + iu)|de. (308) 
Then (303) and (308) imply 
&(w) =0 for w<a (309) 
and since £(w) is continuous, (309) implies 
&(w) =0 for w<X<a. (310) 
Thus we may write 
rie a ee We e-“lo-ala(weitdt, (311) 
. or —a 
Then since 
1 © ; P 
aorite f. e—U|o—alpivtdt = eiatP (t) (312) 
Qr —0 


where P,,(t) is the Poisson kernel of (269), we have 
g(t + iu) = e-% f ” g(s)eialt-s) P, (t — s)ds (313) 


and (304) follows from (313), and Lemma 3 is proved. 


Now we are prepared to prove the second half of the Paley-Wiener 
Theorem for L... We have g(7) analytic for u > 0 and 


sup |g(¢ +iu)| <M for u=0. (314) 
t 
Now suppose h(t) is any function of L, whose Fourier transform vanishes 


over (—~,0). We have from Lemma 1 that h(t) is the boundary value of 
a function h(7) analytic in the upper half-plane u > 0, satisfying 


f. |n(t + iw)dt < f.. |n(t)|dt, w=. (315) 
Now we consider the function 


f(t) = g(7)h(7) (316) 


which is analytic in the uhp and satisfies 


os Ife + iu)|de <M fT |n(t)|de. (317) 
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Then from Lemma 3 

f. 7 f(tje-*tdt =0 for <0. (318) 
In particular 

f. - f(t)dt = f. ; g(t)h(t)dt = 0 (319) 


which completes the proof of the first (one-sided) Paley-Wiener Theo- 
rem. 


Theorem 2. It is sufficient to prove the two-sided Paley-Wiener Theorem 
for functions g(t) whose Fourier transforms vanish outside [—a,a]. We 
show that g(t) is the boundary value of an entire function of exponential 
type by defining 


ue —ta(t—s) 


Gilt + iu) = f. ; alse ad > O (320) 
Golt +iu) = fr g(sje-eu le gs <0. (321) 
= a{(t — s)? + u} 
We have 
|Gy(t + iu)| < e sup lg(t)|, u>O (322) 
|Go(t + iu)| < e~% sup |g(t)|, u<0O (323) 
and since g(t) is continuous 
a Gy(t+iu) = g(t) forallt (324) 
bon Go(t+iu) =g(t) forallé. (325) 
Now we define 
Ga(r) = f. _K(r—s)g(s)ds (326) 


where 
1 o . . 
Ko2 = ff. R weird 
27 J—« 


544 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


and 


K(w)=1, -as<w<a 


=2(1->  aswet% 
2a 
=0, w22a 
K(-w) = K(w). (327) 


Then K(r) is an entire function belonging to L; along each line parallel 
to the real axis. So G3(7) is an entire function bounded on each line 
parallel to the real axis. Since the Fourier transform of g(t) vanishes 
outside [—a,a] we may replace the convolution kernels in (320) and (321) 
by K(t + iu) since in each case their Fourier transforms agree over 
[—a,a]. 

Thus 


Gi(t +iu) = G3(t tiu), u>O (328) 
Go(t + iu) = G3(t + iu), u<0. (329) 


Hence g(t) is the restriction to the real line of an entire function G3(t + 
lu) = g(t + iu) satisfying 


[g(t + iu)| < ell sup |g(t)|. (330) 
t 


Now for the second half of the theorem we suppose that g(7) is an entire 
function satisfying 


sup [g(t t+iu)| < ealu| sup |e(t)| (331) 

and wish to conclude that 
f __glt)h(t)dt = 0 (332) 
for all functions h in L, whose Fourier transforms vanish over (—a,a) 


(and hence over [—a,a]). From the one-sided Paley-Wiener Theorem 
we have 


ff ” g(t)hy(t)dt = 0 (333) 


for all functions h, in L, whose Fourier transforms are supported on 
(—~,—a) and 


f ” g(t)ho(t)dt = 0 (334) 
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for all function hz in L, whose Fourier transforms are supported on (a,~). 
The difficulty we encounter in establishing (332) is that an arbitrary 
function h in L; whose Fourier transform vanishes over [—a,a] cannot 
be decomposed as 


h(t) = h_(t) + hi(t) (335) 


where h_ and hz belong to Ly, and the Fourier transform of h- is sup- 
ported on (—~,—a), and the Fourier transform of h+ is supported on 
(a,o). In order to deduce (332) from (333) and (334) we have to ap- 
proximate the test function h in (332) with bandlimited functions hy. 
We may take 





h(t) = f " bK(bs)h(t —s)ds, b>O (836) 
where 
at 
9 3m aoe 
=< 337 
K() == (337) 
Since 
§__ KWat = f. |K(t)|dt = b f |K(bt)|dt.= 1, 
we have 
f. |hy(t)|dt < ee |n(t)| de. (338) 


Also we may write 
h(t) — ho (t) = f “ 6K (bs){he(t) — h(t — s)}ds (339) 
which gives 
f. i |n(t) — hp (t)|dt < f. * BK (bs)ua(ssh)ds (340) 
where 
yi(s3h) = f. “ [a(t) = h(t = s)|ds. (341) 


The function p(s) is called the modulus of continuity of h. It is an even, 
continuous, bounded function of s (see Ref. 3) and 


41(0;h) = 0 


yi(ssh) <2 f |n(t)|dt = 2ilrlly. 


(342) 
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Then writing 


ff. ” BK (bs)an(ssh)ds = f "Kia (= :i) dt (343) 


vb t 
< f Kt) (53h) dt + 2\lnll, f K(t)dt 
-—Vb b |t|>Vb 


it is clear that given « > 0 we can choose b so large (a < b < ~) that 
f. |h(t) — hy(t)|dt <e. (344) 
Now the Fourier transform of h, is supported on the intervals (—b,—a) 
and (a,b), so hp, does have the decomposition 
hp(t) = h_(t) + he (t) (345) 


desired in (335). This follows from the existencet of a function Kg, (t) 
in L, whose Fourier transform satisfies 


Kapow) = 1, axsw<x<b 


=0, wX-a (346) 
so that 
h4(t) = f. ” ho(s)Kap(t — s)ds (347) 
and 
hella < Holla - Kaolla. (348) 
We have 
h_(t) = hp(t) — h(t). (349) 


So h_— also belongs to Ly. 
Returning to (332) we have 


f_e@nae= fT soho (de+ f~ glth(e) — ha(e)ide, 


(350) 


Since 
J e@)ho(edt =f g(oh_@ae + f_e@hs Ode (851) 
=0+0 
~ +See Appendix B for a good choice of Ky, (t). 
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we have 
| {er @at| <sup gO] [™ |e) - ho(eolae 


<e sup |g(t)|. (352) 


Since we may choose b sufficiently large to make « arbitrarily small we 
conclude that 


f. ” a(t)h(t)dt = 0 (353) 


for all h in L, whose Fourier transforms vanish over (—a,a). This com- 
pletes the proof of the two-sided Paley-Wiener Theorem. 


Theorem 3. Here we wish to show that if g(7) is analytic in the uhp and 
bounded on the real line as well as every line parallel to the real axis in 
the uhp, then the asymptotic estimate 
sup |g(t + iu)| = Ofe-¢4} asu > @ (354) 
t 
implies 


sup |g(t + iu)| < e-¢ sup |g(t)|,  u = 0. (355) 
t t 


Now if x is any real number and y is any positive number, the func- 
tion 


fg(r)e— #97 — g(x + iy)e iat} 


_» 
eee (r—x)2 + y2 


(356) 


where we think of x and y fixed, is analytic in the upper half-plane u > 
0 and satisfies 


f. |n(t + iu)Jdt <<, uZ=O0 (357) 
and 
f. |h(t +iu)|dt =O(1) asu— . (358) 
It follows from Lemma 3 that 


ff. ” A(t)dt = 0. (359) 
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Hence 


—latd . ; 
2 an Oe ree ae = g(x + Ly)e~tatiy), (360) 
Thus 
|g(x + iy)| < e~2 sup |g(t)]. (361) 
t 


This inequality holds for any —~ <x < ~ and any y > 0. Then (355) 
follows from (361), if we note that (355) holds trivially for u = 0. 


APPENDIX B 
Reproducing Kernels of Small L, Norm 


We would like to find a kernel K.g(t) of minimum L; norm whose 
Fourier transform satisfies 


K.g(w) = 1, O<Swk<a 
= 0, w> (362) 


where 0 <a < 8. 

Replacing K,,g(t) by 1/aK.,g(t/a) we see that the minimum norm 
is a function of @/a, or if we like, a function of a/(8 — a). It is sufficient 
to consider functions K)(t) whose Fourier transforms satisfy 


K)(w) = 0, w<0 (363) 
=1, lSw<1+t+A 
where we make the identification 


Qa 
A= . 364 

Pan (364) 

We will not treat the minimization problem here. Instead we give a 

construction for a particular function K(w) which can be shown? to be 

the solution for the case \ = n,n = 1,2,3,.... The construction provides 

an interpolation between the minimal norm values in casen <A <n + 
1. 





We write 
A=n+0 (365) 


where n = [A] is the largest integer contained in A and 0 < 6 <1. Then 
we set 


Ky(t) = 2af{F(t)}? (366) 
t The details will be given in a future paper. 
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where the Fourier transform of F(t) vanishes for negative argument, 


and otherwise is defined by 


Fy\(w) =apn,, ke w<kt+1<nt1 


=Qn41 Nt+t1lsw<nt+14+8 
=0, w2nt+1+0=dAtl1 
where the a; are defined by 


1 feo} 
= > apz? 
V1i-z x . 








1.€., 
— (1/2)n _ ()G*)G*2) _ (G+ 5 1) 
pane ae k! 
; r (5+) 
r (3) r(1 +k) 
We then have 


Rylw) = f, ” B(x) Pw — x)dx 
which is a piecewise linear function satisfying 
m-1 
K\(m) = x Gpam—1-p = 1 


form = 1,2,...,n +1. Thus 


K\(w)=1 for 1<swsntl1 
=qw for 0O<w<X1. 


(367) 


(368) 


(369) 


(370) 


(371) 


(372) 


Forn +15 0<n+1+6 the convolution in (370) is independent of the 


definition of F(x) for x >n+1+ 6; ile. 


K\(@) = Knsi(w) for OSwSnt+1t98. 


So 


K\@) =1, nt1<w<nt1+t0. 


Thus 


Ry\()=1 for 1<w<n+1+06 


=qw for O0O<w<1. 
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(373) 


(374) 


(375) 


We have 


wOry= fi [Ky (t)|dt = 20 f. |F\(t)|2dt 
- fe [Fy (w)|2de 


2+ Oaz.s (376) 


oMes 
Q 


which is a piecewise linear function of X, i.e., 
(A) = (n+1—A)w(n) + (A—n)u(n +1), nSASn+1. (877) 
We have 
89 


5 
u(0) = 1, BOR: u(2) = 6A (378) 


The remainder of this appendix is asvoisd to estimating y(n) for large 
n. We need an upper bound. 
For convenience we set 


1 
i Cie a3 
ES Beit og) 


and then 
1 in 
u(n) =— SS y?(k). (380) 


First we estimate y(x). We have the representation for the Beta 
function 


P(x)PQ) _ 


1 
ty-1 1 —t)*-!dt > 
T(x + y) if ( ) (Rex >0,Rey>0). (381) 


Then 
ft 1a1- Get. 


(x dt 382 
v(x) = We Ni ee Top (382) 
Setting 1 — t = e~* we obtain 
e ~s(x+1/2) 
x ds 
1S ae el ae V1—-e§ 
1 © —s(x+1/4) 
< (383) 


=== —————-d 
Va Jo V2sinh s/2 : 
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Since 
l-e-F<s 
and 2 sinh s/2 > s we obtain from (383) 


(384) 





We can obtain a simple upper bound for u(n) from (384). We have 


ipa Ss 2b) 
WT kR=1 


1 2 1 
<1+- >. —- (385) 
Tk=1 1 
k+- 
4 
Since t=! is convex, we have 
T+1 dt 1 
ff ee (386) 
T t 1 
T+ 
2 


and thus 


1 +3/4 1 | A 
pin) <1+= f B= 1+2tog (1+), (n>=1). (387) 
WT JS3/4 t T 3 
Since log (1 + 44/3) is a concave function of \ and since (A) is piecewise 
linear between integers, we conclude from (387) that 


iQ) 21a Ios (1+), >0. (388) 
Tv 


A sharper estimate of u(A) for large \ is obtained as follows. We are 
interested in the constant term in the asymptotic expansion. 
From (383) we have 


1 1 @. @—*%ds 
a) ee —_——.. 389 
v (3 5) Va Jo V1i—e- — 
Then from the convolution theorem for Laplace transforms, 
1 foe) 
P(x-S)= J, evttels)ds (390) 
2 0 
where 
1 fs a dt 
=— Se — 391 
ls) ai V1-—e-§V1 —e-&-2) een 
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Setting e~* = u in (391) we obtain 


1 1 1 du 
J VE aa Ve me 
where 
v=e§, 
Then with the substitution 
l-u=(1-v)t 

(392) becomes 

g(s) = — a (393) 


i VEV hoe 


Identifying ¢(s) with the hypergeometric function which has the 
representation, 


F \(a,b;c;z) = ai WE) ; to-1(1 — t)e-9-1(1 — tz)-4dt 
amen" Tb)T(e — b) Jo 
(Rec >Reb>O) (894) 
we see that 
1 1 
ols) = oF G aoe). (395) 
Thus 
1 T(x) eo 1 1 
2 SSS —sx Sepp wag Ks 
P (2 ) 7 f, e Fi (5.5541 e ) ds 
T? (« + 5) 
2 
(396) 
and 
Jon 1 ot — —s(n+1) 
u(n) =~ Yo) x(k) = — ~ —~——— ¢-8/2s)ds 
T k=0 0 l-—e7s 
ee a fe-8/2 — e—s(nt3/2)} Freee gs) =| a 
1—e7 S 
© p—s/2 — p—s(n+3/2) 
i f i icles SAB 
x J0 Ss 


1 oc 
= — log (2n + 3) +o f e—s/2 [al ds 
w wT J0 LSet s 


- 1 f e—s(n+3/2) eae — ,| ds. (397) 
. 8 


T l-e 
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From (395) and the series 


Pe b)p 2 
oF y(a,b;c;z) = > {eee (398) 
k=0 (cc), R! 
it is clear that 
gs) = ¢(0)=1, s>0 (399) 
and hence that 
1 
ON —~=>0. (400) 
l-e- s§ 
Thus from (397) we have 


ie Ise Qn+3)+— f" en8 [eo - 4) ds (401) 
T T JO0 l-—e-s gs 


and in fact 


lim {(n) - ~ log (2n + 3)| = = ss ~ f° e7s/2 | o - ;| ds. (402) 


n> 1 — e278 


Now we can evaluate the integral in (402) by an indirect route. 
We have 


12 m((L/2)n(/2)e 
u(n) i er bay =2 | kik! a(k + 5 
Ln 1 
al ly a i} _ 


From the estimate (384) we see that the sum on the right converges 
asn — ~3i.e. 


Now we can write 


= 


Az wae he 
Az- TH) ~5 | = lim pe eee 


[= (1/2)e(1/2)e 4, x* | 
k+1 x1 





a 1 
= s=s1;x)-—1 
tim [Fs (igs hi#) ~ low 


+ a (405) 
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Then using (394) we have 





: 1 1 1 
azA = lim a= ae | ae 
x>-1J0 (Vt(l—-—t)(l—xt) 1-—<xt 
1 dt a -1| 





oe 0 1—xt ae 
tim (mala tim La ves 
1 V1 —xt 
=, = Him aera -1} 


= 2log2+2log2. (406) 


Some care is required in evaluating the last limit. An alternative way 
of obtaining the result is worth noting, as it makes use of an interesting 
series obtained from (396). A change of variables gives 


_ x) f (1 — t)*-15F, Gs 5) It) at 


nee) 


= fae [fe LMA a a 


Nile 


3 
a + 2(? | (407) 
7 x(x +1)(x+2)) 2 a 


eo een 


Then setting 


T2 («+5) 
_\ 4 


T(x + 1) 


(1/2), )? : G+ 2 2 


Pee Ve 


r S) T(k +1) 


F(x) = 


F(R) 


Sie 


we have 





1 fe 
- (+ 7 7 7 se ze ae ve 


which is an interesting formula. In particular (F(0),F(1),F(2),...) is 
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an eigenvector of a certain infinite matrix. Also an interesting series for 
a is obtained by setting x = n + 1/2 (large n) in (408). Returning to (407) 
and recalling az = (1/2);/k! we have 

xT2(x) 2 T2(x + 1) = 4igp at 2! 


r? (x +5) x? (x +5) eA (x + D(& +2) 





agt+... 


(409). 


Then using the series 


1 1 
= f, (b= t)"ldp = §. (1—é)*(l+eite2+...)dt 
x 0 0 


1 1 2! 











= 2 ee >0 
eo (ee Dero) GAD Poe 3) on) 
we may write 
1) r%e+1) rq) 
m+) Q) 
2 2 
1 1 9! 1 
wth fatty + lat-2 
eat | GED So) oe 
k! 1 
+... Hae —-— lt... (4 
G41); [ak | oo) 
Since 
1 
J <ai< 


we may let x — 0 with the result 


(411) 





and since a2 = 1, this sum is the same as the sum on the right in (404). 
Hence the limit in (404) is 


1” 1 4 
it ey |S oe 412 
{u(n) ror al meee we) 


no 
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and since 


lim | 


n-7o 


1 
b — log n| = C = 0.577215... (Euler’s constant) (413) 
we have from (412), (413), and (402), 


lim {(n) - = log (2n + 3)| == few 


no 


eel 4 


1l-—e- = ¢s 


e log 2 + 2 (414) 
T T 


Then from (401) and (414) we have 
C 
p(n) < Z log (2n + 3) + 2 log 2+— (415) 
vis “ie Tv 
and by the same argument used in establishing (388), 
1 3 C 
(A) <— log (2A +3) +—-log2+—, A>O. (416) 
Ce ry 7 


We find from (397) and (414) that 


mu(n) ~ log (2n + 3) +3 log 2+C 
3 43 7 


2(2n+8) 48(2n+3)2 16(2n + 3)3 
+ O(n-4). (417) 


For comparing the estimates (389) and (415) we have 
1 4 
1 tee logs = 1.0915720476 (418) 
Tv 


1 
= {4 log 2 + C} = 1.066275853 (419) 
vis 


and for use in (415) and (417) 


2.656657207 


ve 


1 
—{3 log 2+ C} = 
TT 

= 8456402533 (420) 
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In the following tabulations the estimates (388) and (415) and the as- 
ymptotic formula (417) are compared with the true value of n(n). 


n p(n) Asymptotic formula Error 
1 1.25 1.249927097 7.29 (—5) 
2 1.890625 1.890607982 1.70 (—5) 
3 1.48828125 1.488275479 5.77 (—6) 
4 1.563049316 1.563046870 2.45 (—6) 
5 1.623611450 1.623610248 1.20 (-—6) 
6 1.674500465 1.674499809 6.56 (—7) 
7 1.718379259 1.718378872 3.87 (—7) 
8 1.756944605 1.756944363 2.42 (-—7) 
9 1.791343941 1.791343782 1.59 (—7) 
10 1.822389342 1.822389234 1.08 (—7) 
n Upper bound (388) Upper bound (415) 
1 1.269703286 1.857940252 
2 1.413574619 1.465042692 
3 1.512299999 1.545038559 
4  1.587544884 1.608914025 
5 1.648359655 1.662088992 
6 1.699398305 1.707639405 
is 1.743372924 1.747480071 
8 1.782003284 1.782884290 
9 1.816448738 1.814741844 
10 1.847528028 1.843699061 
APPENDIX C 
Estimates for M, 
kr 
Mn n+l on oe 2(n + 1) ey) 
k odd 


In order to express the sum as an integral we note first that 


where 


558 


d é 
ma cot 7x = — log sin zx 
dx 


Vv 


= oe 8 Pa rd —2) 
= y(1— x) — (x) 





2G) 5 A es 
wT f | er | at, x >0. 
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(422) 


(423) 


So 


1 dt 
x cot rx = §, fge-1 — p-2y (424) 
0 Lt 


oO eu 
= f, jee yee) dy. Caer <i, 
0 Le 4 
Then for 0 < 6 < 1/y, where » is an odd integer, 


m0 3 cotk7ré= 6. 
k=l 0 
k odd 


foo} (e~48 = e—(vt2)u8) 


(ev = e(v+2)u8) eu 
"eg \geese 


or 


70 > cot k70 
k=1 
k odd 
_ ie {ex (e-t — e-+2)t) " (e-t — oy enat - 
0 L=e=2 Least Le=e! 
oe-t — er 2)t co {1 = e— Wt )t}2 e (v—a)t : 
= —————- dt - f, —————— dt 426 
J, 1—e-2! 0 1-—e72# 1 1-—e-at Be8) 


where a = 1/0. For n odd we take vy = n,a = 2(n + 1). Then 








4 © p—t — p—(nt2)t 4 © {J — e—(nt1)t}2 e—(nt2)t 
ged (ore enim do mening emtnten 
JO 1—e7 rJO0 Peertt Tee 2OrVe 
4 ree t— Cs ise. di 
os ——_—__—_-—_—__ qt — 
= 1—e72 a). t Ite 
sinh 
n+l 


nodd (427) 


The first integral is just the sum of the reciprocals of the odd integers 
from 1 through n. We have 


f, eae = log (n + 2). (428) 
So 
2 f, E a dt = log (n + 2) + fj, pot Fares _ ,| dt 
= §, © en (nt2yt fae = ;| dt. (429) 
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We have 


2 Z 1 1 2 I. 
a dt= { | + | 
af : es a o tl-—u? logu o 
1 
- (Rte Na 
o tl-u 1+t+u _ logu 


1 1 
=f ae +f | ee ee | du 
0 l+u 0 ll-—u_ logu 




















= log 2 — ¥(1) 
=log2+¥ (430) 
where 
y = 577215... (Euler’s constant). (431) 
Thus 


Mee Seedy Geet aw 
rg a 


-2 grea ( 2 Ng 
x JO l-—e72t ¢ 


_ 2 f° 1l—e7é dt 
m(n+1) Jo t l1ltet 


sinh ——— 
n+1 


(n odd) (432) 


We may write 


© 9 1 co 1 e~é 
e 7 (n+2)t Fees Par | dt = f, e~(nt+i)t | = oa dt 
J, l-e-# tft 0 sinh t t 
_ f e (nti | 1 a ‘| dtt+ f. ea (nt+Dt (1 iss e-t) dt 
0 sinht t 0 t 





























1 os 1 +1 +2 
- f, e-t J—-~—_ __2** | a4 log? **. (438) 
n+1 Jo : t t n+1 
sinh 
n+1 
Also 
°j]—e7 dt 
n+1 1+eé 
sinh 
n+1 
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el1—e! fe cw 2 ed 


























— dt 
—n+1 1+e! t 
sinh 
n+1 
foe) 1 — —t 
i f, Set 80) 
0 ltet ¢t 
and (Ref. 9, p. 327) 
-“j{-—e-tdt 7 
———— = log—- 435 
Si ieee See 2) 
Thus we find from (432)-—(435), 
2 
M,, = — log (n + 1) 4 (log +7) 
TT Tv Tv 
4 “intl 1 dt 
ee J, pane ae 
mn +1) Jo t : t 1+et 
sinh 
n+1 
nodd. (436) 


Clearly the integral in (436) is positive. In order to obtain the asymptotic 
‘series for M,, n odd, we use the generating function for the Bernoulli 
polynomials (Ref. 1, formula 23.1.1) 








ee Rey | Eee (437) 
to obtain 
t a 1\ tk 
= > B, ‘Gs \t| <20 (438) 
_, t k=0 2/ k!} 
2sinh > 


where (Ref. 1, formula 23.1.21) 





By (5) =—(1—2!-*)B,, By, = B,(0). (439) 
Thus 
Jee Ota ee (440) 
sinh t¢ k=0 ki 
We have (Ref. 1, formula 23.1.19) 
Bor+1 = 0, k =1,2,... (441) 


ANALYTIC MODULATION SYSTEMS 561 


and (Ref. 1, Table 23.2) 























1 1 1 
Bo=1, Bi=—-=, Bo=-, Bs=-— 
’ Bie ei r= ae ae BG 
qi 1 5 691 
Bg =— ’ Bg=-— ’ Bi =, ’ By2= ’ 
ae | ae | ena) 1), 
7 
Big = 6 . (442) 
So 
1 1 is Bopt2*-} 
= — = > (22% — 9) —_ 443 
t sinht P» ( ) (2k)! a) 
aero. ot to+... |t| <2. 
ee 360 15120 
From Ref. 9, p. 325, we have 
oo y2k-1 F ‘ ek : 
= (242 —2 ——., itl youre 444 
So ani® ( Boel “gy Sa 
Also, (Ref. 1, formula 23.1.18) 
7 7 22n)! 2 1 D 
Bon = Bon(0) = (-I"41 Oo, LB... (445) 
0) 
Bop = (—1)**3|Bop|. (446) 
Thus we obtain the asymptotic series 
n+l a 1 dt 
n+1 Jo t ; t 1+e! 
sinh 
n+l 
o (—1)k+1(22k — 2)2B2, a \2k 
bei 4k(2k)! € + i) 








2 4 
~ =, ( ul S aa TA ul ) +... (447) 
72\n+1/ 43200 \n +1 


and hence 
v 49 x3 
18(n + 1)2 10800(n + 1)4 
(eh eS ZTE us : i 
T k(2k)! n+1 


M, ~ Slog (n +1) += = (log = +) + 








Teese (n odd). (448) 
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Now we would like to show that 
M, > = log (n+ 1) fe = (log = +7) 


Tv 
18(n + 1)? 
w 4973 
18(n + 1)2 10800(n + 1)4 
etc., (n odd) (449) 


i.e., that the error in truncating the asymptotic series has the same sign 
as the next term of the series. To do this, we show that for t > 0 


<= log (iba) = (log = +7) + 


> = log (n+1)+= = (log = +7) + 














11 
t  sinht 
t 
< = 
6 
6 360 
< t = TT t3 31 t5 
6 360 15120 
etc. (450) 
We have (Ref. 9, p. 23) 
0 1 
=14+2t2 © (-1)* ———— 
sinh t Py (“1) t2 + kn? (451) 
or 
1 t 2 ak 
—i1— =2 —1)k+1 ——__., 452 
t? sinh | x A) t2 + k272 On 
Now consider the polynomial 
2n+2 
Uae) 
ik 
Po, (t;k) = 2————. 
an(t;k) 12 2g? (453) 
We have 
Pon (t) = ¥ (-1)**1Pop (tsk) (454) 
k=1 





= 2(- [)ntlpente = (=0ee 


1 t 
=—J1— Se 
t2 | sinh t =1 (kw)2"*2(t2 + kx?) 
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or 





iL 
ap | ~ Pant) 
(- 1)k+1 
1 (kaw)22+2(¢2 + kx 2)" 


It follows that P2, (t) is a polynomial of degree 2n which agrees with the 
first (2n + 1) terms of the Taylor series of t~?{1 — t/sinh t}. The sign of 
the difference in (455) is (—1)”"*!, as the sum is clearly positive. Then 
(450), and hence (449), follows from (455). 

For n an even integer, the asymptotic expansion of M,, is not so readily 
obtained. In this case we set v = (n — 1) in (426) and keep a = 2(n + 1). 
Thus 


4 © e7t — e—(ntl)t 
M == { Eas SSM 
"a Jo 1—-—e-2 


(L—en")? — e7(+3t 
= 1 —e72t jo ent0ene t? neven. (456) 


= 2(- [)rtlzg2nt2 2 (455) 


We have 


foe) et — e—(ntlt 
2 f ee ag ee OR (nee) 
0 l—-e 


« 2 1 
+ —t — e-(ntt Fase: _ | dt. (457 
[tert eG eee dh. 5) 
Now we would like to express the second integral in (456) as an asymp- 
totic series in (n + 1)~!. For convenience we set e~* = x. Then 


(1 -—e7nt)2 (1 — x")? {l—x"tl—x(1 — x)}? 


1 — e~2(nt Ut ~ 7 — x 2nt2 (1—x"t1)(1 + x71) 


7 r _ xn a F eee ee — a) 


Ltx7tl Y4+unrtl 
a eee eee 2) Le x)  x2"(1 —x)? 
Tt ertl poyrtl zpayntl — y—x2nt2 ° 
Therefore 
co (1 = eanty2 e—(nt+3)t oj— e—(ntlt e~(nt3)t 
eant 1—e7é —(n+3)t 
a9 (2 Oe ea 
Lae nD) ise 
5 ft ee — e-t)2 e-(nt3)t 4 
0 


L—e-2ntDt = 1 — e~2t 
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of ea (nt+t dt 
= f, Ltelmtit e% 1 
0 Llter~™tbDt et +1 
© -38(ntl)t 1 —e-t 
A ear taecrdt. (458) 


We can combine a part of the second integral with the last by noting 
that 





=e 








1 1 1 t 
=—--—-tanh-. 
eb+1 2 2 2 
Then 
© p—2(n+1)t dt © g—3s3(nt+1)t t 
-2 { L+er-(@tDt apts), Ta onaintie tanh, at 


1 t 
— d ——— tanh ————- dt. 
(n+t1)Jo 1t+e-! (n+1)Jo 1-—e72 2(n + 1) 


By making use of (430) and (435) we obtain 











M, = = login + 1) += = (log = +7) 


== few os dt 
1 JO l—-e-% ¢ 











ees 1+e! t 2t 
exp an, cao 4 
n+l 
© p~2t 
ee c dt 
r(n+1)Jo 1t+e! 


t 
= —— tanh ———— dt, neven. (459 
a(n+1)Jo 1-—e7 a 2(n + 1) ve) 


From (437), (441), and (442) we have 








9 1 oo , Bat? oo Bopt 2k-1 
-= =-l1+ 92k ———___ ; ti< 
‘-1 ¢t ae RI Pa (2k)! a 
(460) 
2 = Bopt 
1+ Q2k ; t}< 461 
1- e—2t 2» (2k)! | | w ( ) 
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i.e., 


2 1 2 1 Bopt 2* 
———_ ---l= —-4+1= gor Bort 462 
l-—e72 ¢ e2?—] ft S (2k)! age?) 





Thus 


ojJ—e-é +1 2 
f ie ae 7 
a 1+eé t 9 


t 
e —_—_— — 
7 Ge ) 


== ("erwin fo eo, dt 
rT J0 l-—e-2 ¢ 


2 “l-—e-§in4+1 2 
oe i 2 rs cree 
mn+1)Jo 1+te! t ot 

















2 ed 2 n+l 
oa het Tee 2 ——————$—<——— ie —i\dt 
aay —2t i 
l-e 
= e-t dt 
eee orn | 


l-e n+1 2 
et —_ ——__- - | d 
=a I ote | t 2 


2 4 o et 
m(n+1)Jo0 1lteét 














dt. 
Then from the above and (459) we have 


Mn == log (n +1) += = (log = +7) 


Fi 4 f- n+1_ 2 4 dt 
a(n +1) Jo t Ot ; 1lt+eé 


exp (———) - 
. 3 ae 7) 
2 o et 


t 
— ———_ ——— tanh ———— dt 
r(n+1)Jo0 1-—e7é a A(n + 1) 








neven (463) 
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which is a form suitable for asymptotic expansion. We have 


2k-1 
tanh x = 92k(92k —1)B 464 
anh x = oy ( ) * (Qh)! (464) 


ie dt af ese Aiea 
0 1+e! 0 
1 1 1 
= pl ~ = 
Al (1 ght + gkti at) 


= k! (1 = 5) tk +1) (465) 














where 
Gis Ss wok (466) 
R=1R"™ 
Also 


ror) et for) 
ak thdt= tk(e-'+ e224 e-3. .) dt 
0 l-e-§ 0 


1 1 
= k! Cea 


=k! ¢(k+1). (467) 











From (460) and (465) we have 


—s Sf n+1 b.. 2 aj dt 
are +1) Ot 1+et 
exp ( 
a 
_ 4 ° Bo, 2?" (2k = 1)! 
wk=1 (2k)! (n + 1)2% Ge 








92k— =) §(2k) 


2¢ Bo, _ §(2k) 
y— O28 9) 2 ee 
oe mene (68) 
and from (467) and (464) 
2 © e-t t 
ate ae yy = fal 
a(n + 1) oi1l- ew7t tan A(n + 1) dt 
aS Bor 
a deat Dh (92k pa _p! 
a ( 1) ao (2k — 1)! (2k) 
Bop §(2R) | 


~— =X (g— p12) St 


T R=1 k (n+ 1)2%- mee 
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From (445) we have 


(2k)! 
Bop = (—1)*+12 (amon SR). (470) 


Adding (468) and (469), using (470), we obtain 
2 4 
M, ire (n+1)+- (log + v) 
T T Tv 


Sa eee oe ee ce 
36(n+1)2 21600(n+1)4 9 
B3, qk 


k(2k)! (n + 1)?’ eee 





Eg i (—1)#(24" — 2) neven. (471) 
vis 
In the same manner as before, we can establish that 


2 
M, <= log (n + 1) += (log< +7) 
WT Tv via 


D 2/7 4 a” 
>= log (n +1 (1 =+ Vee as 
BRACES et) ar eas 
2 2/7 4 a 127 
<-log(n +1 += (Io a err ramar aera T 
ete (08 1) ne ae cu GD 
etc., forneven. (472) 


In this case we have two functions to consider in the polynomial ap- 
proximation problem. First we note that 





om] + 1=cothx (473) 


and 





coth x ¢ log sinh ] F 1+ x? 
=> xr 
dx 2 dx oe {= pus ( a) 








1 © 
= - + 2x > 24 hegd . (474) 
Clearly, for x > 0, 
See <0 (475) 
ex—] x 
and 
poy tiscoths Zam 5 Ge (476) 
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Now we have 


coth x — Z 
A (477) 
x kai x2 + k272° 
Defining as before 
1 ( X \ 2nt+2 
=a) 
Pon (x;k) = aera k =1,2,3,... (478) 
and then 
Qon(x) = 2 Y Pon ik) 
1 
coth x —— er 5 
x X\ 2nt+2 & 
= — (-1)"*1 (=) —___“______ (479 
x (-1) wv Py h2nt2( x2 + k27) ( ) 
we have 
1 
coth x — : ae 5 
Se, = (—1)nt1 =) ee 
RRSP A) ae a 
(480) 


So Qon(x) is a polynomial of degree 2n which agrees with the first (2n 
+ 1) terms of the Taylor series of x~{{coth x — x—}} and the sign of the 
difference is (—1)”"*!. 

For the second function we have 


d @ 1 
tanh x = —— log cosh x: = 2x >| ———_————,, (481) 
x2 + (2k — 1)? Z 


Now we define 


1 ( 2x 2n+2 
Nik -1 ) 
Pon(x3k) = SE k =1,2,... (482) 


2 
x2 + (2k - a 
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and 


Gon(x) = 2 2, Pan(xik) 
2n+2 
a: tanh x ~ (141 (*) = 1 —. 
x =1 “is 
(Qk — 1)2n42 [x? + (2k — 1)? =| 


(483) 
Then the alternating sign of the error 


tanh x 


— dan (x ) | 
follows; i.e., 


tanh x <x 
3 

ge 
3 


ye. 22 
er ea sare res 
3 615 


etc., forx >0. (484) 
The inequalities (472) then follow from (463), (476), (480), and 


(484). 
Then for n even or odd we certainly have 





4 fi v 
M,, > — | +1)+—-(log-+y)-=— 485 
Ree a = (oe 7) ati 
or, giving away a little, 
M, > log as a = (log = + Y); aaa (486) 


Obviously (486) is true for n odd since for n odd we may replace n in 
(486) by (n + 1), and for n even we consider 





log (n + 1) - = =logn+t+lo 1+2)-2_* 
: ae fe" & ( n/ 72(n+1)2° 
Now 
x eo x x2 
log (145%) = raw cat | (= Hapsg = Hw: 
08 ( =) o lt+t 0 ( ) : 2 - 
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So 


1 2 1 
log (1 ) pl a BS forn = 2. 
n 72(n+1)2 n 2n? nn? 
Thus (486) is valid for n = 1. 
We note that the actual computation of M,, is simplified, particularly 
for n odd, by making use of the identity 








1 + cos 26 
cot. = 
sin 20 
Then for 6 = z/2(n + 1), 
2 2 1+ cos 2k6 
M, = tké=—— —_———_, 
n+l ae os nt+i1 2 sin 2k60 (487) 
k odd k odd 
For n odd, k = 1,3,5,...,n, 
2Q2hkr ee 2n+1—-—k)x 
os ———— = =cos 
= (n+ 1) 2(n + 1) 
. 2khr . An+1—k)r 
sin ————— = sin —————_ 
2(n + 1) 2(n + 1) 
So 
2 1 
M,= = SS odd: (488) 
n+ lig<k<n . 
k odd sin 
n+1 
We find 
M, _ 
2 2W/3 
Mos = —= = — = 1.1547005. . 
e" V3 3 








ya (ae | 2 4 
M4 ==|—— + = [1 + cos 7/5] 
. . Qr . Qr 
sin- sin— 5 sin — 
5 5 5 
= 1.5216904... 
M;= ° = 1.6666666. . . 
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For use in the asymptotic formulas we have 


= = 0.636619772367 ... 


Tv 


log ae 0.241564475270... 
T 
y = 0.577215664901 ... 


4 
log = + y = 0.818780140172... 
wT 
27 4 
2 (log = + 1) = 0.521251626... 
mie “is 


* = 0,174532925... 
18 


ths = 0.610865238... 
36 

4973 

10800 

127x3 

21600 


The inequalities (449) and (472) give 





= 0.140676625... 





= 0.182305423... 


M, > 0.96252282... 
< 1.00615605... 
> 0.99736376 ... 


Moe < 1.2206499... 
> 1.1527760... 
< 1.15502669... 


M3 > 1.40379402... 
< 1.41470233... 
> 1.41415281... 


My < 1.54585162... 
> 1.52141701... 
_ <1.52170870 


M; > 1.66192113... 
< 1.66676926... 
> 1.66666071 ... 
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APPENDIX D 
Proof of Theorem 5 


We are given 
w(t) = e2) (489) 


where the Fourier transform of z(t) vanishes outside [0,Q]. Also the 
Fourier transform of w(t) vanishes over (a,b) where 0 < a < b and 


b-a>Q (490) 


and we wish to show that w(t) = constant. 
We may write 


w(t) = g(t) + h(t) (491) 


where the Fourier transform of g(t) vanishes outside [0,a] and the 
Fourier transform of h(t) vanishes over (—~,b). We have 


w'(t) = 2’(t)e? = 2’(t){g(t) + h(t)} (492) 
= g(t) th'(t). 


Now the Fourier transform of z’(t) vanishes outside [0,0] and the Fourier 
transform of g’(t) vanishes outside [0,a]. By Corollary 2 of Theorem 2 
the Fourier transform of z’(t)g(t) vanishes outside [0,a + Q]. The Fourier 
transform of h’(t) vanishes over (—~,b) and by Corollary 1 of Theorem 
1 the Fourier transform of z’(t)h(t) also vanishes over (—~,b). Thus if 
Ka,» is any kernel of L, satisfying 


f ” Kap(tie#tdt =1, 0<w<a (493) 
= 0, wb 
we have 
f. w'(s)Kap(t — s)ds = 2’(t)g(t) = g’(t). (494) 


Now 2(¢) and g(t) are the restrictions to the real line of entire functions 
of exponential type; so 


8'(7) 


= 2’(r). 495 
a) 2'(r) (495) 


Hence g(r) is zero free in the entire plane and is, therefore, of the form 
(Theorem 2.7.1, Ref. 6) 

g(r) = Aeir, (496) 
Hence from (495) and (496), 2’(7) = 1A, and since z(7) is bounded on the 
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real axis it follows that \ = 0; i.e., z(t) = constant and hence 


w(t) = constant. (497) 
APPENDIX E 
Proof of Theorem 6 
We are given 
w(t) = log {z(t)} (498) 


where the Fourier transform of z(t) vanishes outside [0,0] and for some 
positive 


|z(t+iu)]=>« foru=0 
—-0<t<o (499) 


Thus w(7) is bounded and analytic in the uhp; so by Theorem 1, the 
Fourier transform of w(t) vanishes over (—~,0). Also we are given that 
the Fourier transform of w(t) vanishes over (a,b) where 0 < a < 6 
and 


b-a>Q (500) 


and wish to show that z(t) = constant. 
We proceed as in the proof of Theorem 5 and write 


w(t) = g(t) + h(t) (501) 


where the Fourier transform of g(t) vanishes outside [0,a] and the 
Fourier transform of h(t) vanishes over (—~,b). We have 


yee” ayant) (502) 
z(t) 
or 
2/(t) = 2(t)g’(t) + z(t)h’(t). (503) 


Now the Fourier transform of h’(t) vanishes over (—~,b) so by Corollary 
1 of Theorem 1 the Fourier transform of z(t)h’(t) also vanishes over 
(—~,b). By Corellary 2 of Theorem 2, the Fourier transform of z(t)g’(t) 
vanishes outside [0,a + Q]. Since a + Q < b we conclude as in the proof 
of Theorem 5 that 
2'(t 
z(t) = 2(t)g'(), or g’(t) == (504) 
z(t) 
and hence that z(t) = Ae‘ and since g(t) is bounded, \ = 0. There- 
fore 


z(t) = constant. (505) 
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APPENDIX F 


Lower Bound on the Degree of Certain Polynomials 
Suppose a polynomial of degree »v is of the form 
Pi(z) = 1+ anz”" + ang z7t1 +... a,2” (506) 
where |a,| > 1, and P,(z) is zero free for |z| < 1. Then 
py = 2n. (507) 
To prove this assertion we assume 
p< 2n. (508) 
Then assuming that P,(z) is zero free for |z| < 1, the function 


G+ G12 +... Gz? $2” 


Ltanz "+ angy2z"t1 +... tages 1 no) 
is analytic for |z| < 1 and 
[f(e®)| =1, —1r< O07. | (510) 
Then 
f(z) = > beets lz| <1 (511) 
where 
by = = f. : flei)e-ikogg, (512) 
Thus from (510) and (512), 
[b.| <1. (513) 
However, with the assumption pv < 2n we see from (509) that 
bh =a@,-» forO<k<yv-—n<n. (514) 
In particular 
by -4 = Oy: (515) 


But |a,| > 1, so (515) contradicts (513) and therefore (508) is false. 
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Inductive Post Arrays in Rectangular Waveguide 


By T. A. ABELE 
(Manuscript received July 15, 1977) 


Previous attempts, based on mode-matching techniques, to obtain 
precise data for the equivalent circuit of inductive post arrays in rec- 
tangular waveguide have consistently failed due to convergence 
problems. A different formulation is presented for symmetrical post 
arrays, which is shown to be free from this defect. 


I. INTRODUCTION 


Waveguide band-pass structures employing cascades of inductive 
posts have been built for many years. They usually contain a symmetrical 
arrangement of posts in each cross-section, mostly one, two or three 
posts. The latter arrangement, for instance, is a favorite for \/4-coupled 
filters, since it strongly reduces higher order mode interaction. This al- 
lows \/4 spacings to be used instead of the 3\/4 spacings required for 
single post filters, thus leading to shorter filters. 

In the past all of these structures had to be designed on the basis of 
measured data for the equivalent circuit of the cross-sectional post ar- 
rangement, because the available theoretical calculations!?* are not 
sufficiently accurate. The obvious problem with measured data is, of 
course, that two errors are introduced, whose magnitudes are only poorly 
known: dimensional tolerances of the sample to be measured and errors 
in the measurement itself. 

Previous attempts to obtain theoretical data based on mode-matching 
techniques have consistently failed due to the convergence problem 
typically associated with taking a finite number of unknowns out of two 
sets of infinitely many unknowns. This paper presents a formulation 
which leads to only one set of infinitely many unknowns in the case of 
single or double posts. It may thus be expected that, when a finite 
number of these is taken, no convergence problem will be encountered. 
One may also speculate that this will continue to be true for arrays in- 
volving three or more posts, although in these cases more than one set 
of infinitely many unknowns is encountered again. 


577 





Fig. 1—Post array. 


ll. CONFIGURATION 


We wish to determine the equivalent circuit of the array in Fig. 1 in 
the plane z = 0. 

The posts are circular. They are numbered consecutively from » = —M 
to » = M with p = 0 designating the center post. The array is symmetrical 
with respect to the plane z = 0 and the plane x = 0. The center post may 
or may not be present. Each post u has a diameter d, and a coordinate 
x = p, of its axis. Only dominant (TE,9) mode propagation is assumed. 
The surfaces shall be perfectly conducting. 

The electric field will be calculated as the superposition of two fields; 
the field which exists without the posts, the unperturbed field, and the 
field generated by the surface currents on the posts, the perturbation 
field. The surface currents, or rather the coefficients of their Fourier 
series, are treated as unknowns, which are subsequently determined in 
such a way that the tangential electric field vanishes on the surface of 
the posts. As usual, only two special cases of excitation are studied, even 
and odd, since this suffices to determine the equivalent circuit. 


lil. UNPERTURBED FIELD 


We set 

Ey even = (e/8e? + e—J8s?) cos — (1a) 

a 
Ey oad = (e488? — e—J8s7) cos as (1b) 

a 

with 
2 

By = Iv p-= (2) 
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Fig. 2—Post surface. 


where (dominant mode assumption) 
=—>- (3) 
a 


d is the wavelength in our medium. Obviously the fields in Eqs. (1) fulfill 
the boundary conditions everywhere except on the post surfaces. 

For later use we wish to develop these fields into Fourier series on the 
surface of a post v located at p, and of diameter d,. 

From Fig. 2 we see that the post surface has the coordinates 


» = py~ 5d, 008 p (4a) 
1 ; 
2 = 5% sin (4b) 


Introduction of these expressions into Eqs. (1) and use of the well-known 
expansion of e/? sn? (Ref. 5, p. 22), results in 


Ey even — 3 In G Bd, ) e/ne 


n=—o 


x | cos (Fa, _ ngo) + (—1)” cos E Dy + ndo) | (5a) 


oS 1 . 
Ey oda= DX dn G Bd, ) esne 


x | cos (Fp. — no) — (~1)" cos (= Dp, + noo) | (5b) 
where 


cito =~ (8, +57) (6) 
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IV. PERTURBATION FIELD 


To determine the field generated by the current distributions on the 
posts we observe first that all of these currents are independent of y and 
in the direction of y. Secondly, the effect of the broad waveguide walls 
can be replaced, making use of the common imaging technique, by as- 
suming that all posts are infinitely long in both y directions, and, again, 
have current distributions which are independent of y and in the di- 
rection of y. Thirdly, by employing the same imaging technique once 
more, we can replace the effect of the narrow walls by periodically re- 
peating the array of infinitely long posts in both x directions with post 
locations and current distributions, which are consecutive mirror images 
of each other. To determine the perturbation field, we can then simply 
sum up the fields generated by these infinitely many and infinitely long 
posts, without having to worry about the boundary conditions on the 
waveguide walls, since they are automatically fulfilled. 

From basic electromagnetic theory we get for the electric field gen- 
erated by a current filament stretching in y direction from —~ to =, lo- 
cated at xo, Zo, and of strength I,, only the following component in y- 
direction 

Jou 2 e-JBl V(x=x0)2+yo2+(2—20)2| 
Ey=—"T dy ff. Sr ayo 
us —|V(x — x0)? + yo? + (z — 20)? | 


= — 24 HMB] Vea aE Se ) 


The latter transformation may be found in Ref. 3, p. 27. u is the perme- 
ability of the medium. 

Making use of the symmetry of our structure and summing over all 
currents on all post surfaces we obtain 


M Qr © 
Fy=-“— ¥ 1,0) XS (-1)kHy® 
4 p=—M 0 k=—o© 


(ov (2 -=4,siny) + (« = pa tha +4, 008¥)']) dy (8) 


Figure 3 explains the quantities /,(Y), d,, p, and the coordinates 
used. 

With this expression for the perturbation field we will do two things. 
First we will determine its value at a large distance to obtain expressions 
for the elements of the dominant-mode equivalent circuit. Secondly, we 
will evaluate it on the post surfaces in order to be able to come up with 
an expression for the boundary condition for the tangential electric field 
there. 
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Pu 
ly=ly (y)dy 


Fig. 3—Post. 


V. PERTURBATION FIELD AT A LARGE DISTANCE 
If we write 


1 
B=- 
a 





1 
Z — 5 d,siny| 


C= 


Ql 


1 
(« — Put 5 dy cos 





A= Ba 
in Kq. (8), we can apply Eq. (34) from Appendix A to Eq. (8), which re- 


sults in 
M Qa 
a a 6 TC) 
a= Ts 2 
, exp |\/a2- SR eo 
[= 
= 2k — 1)27? 
R=1 [Vee 
x cos [IF ( — py + 5 dy cos ¥) | 
2k — 1)2x? 
p— [VSP - 
io SC eT 
k=l41 (2k — 1)?7? 
Vane 


X cos | («pat 5 dy cosy) |} ay (9) 








1 
2 — 5 4,siny| 








1 
2 -54,sin y| 








with 
1 — 21 + 
lel > Sd (21 Da 2 ge 2k 1)x 
a 


a 
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The second of the two sums over y obviously represents the evanescent 
modes in the rectangular waveguide and, therefore, vanishes for large 
|z|. In accordance with our assumptions we have 


hae (10) 


which means that / = 1 in Eq. (9). We therefore find from Eq. (9) for 





Zz 
—»>1 
a 
fi var 
1 
X cos " (x = Dis +5 du cos ¥) | dy 
a 
= WEL ae ae I a) [e ~JBgz tia (x— Pullaess led, sin (Y+¢o) 
~ 26,0 wn 


4 e-iBg2—Jm(x—py)/agj5bd, sin (Y-e0)] dy (11) 


where ¢o is again defined by Eq. (6). Using once more the expansion al- 
ready used in Eqs. (5) we obtain 


wh _.., M = 1 
mar Ut cal Doge? 2 Im (5 Bay) 


g u=—-M m=—-& 
T Qa : 
X cos E (=p) + moo | f, T,(per’dy (12) 
a 0 


The inversion of the order of integration and summation employed here 
presents no difficulty. 

Physical considerations tell us that the currents I,,(W) can be developed 
into a Fourier series. We write in the usual manner 


I,(p) = y Camel e™ (13a) 


ma=—-a 


where, of course, 
1 2a . 
mao, LaWeninedy (13b) 
27 J0 


Eq. (18b), when combined with Eq. (12), results in 





_ _27wp aie M ba 
By= FR enite Fr (Dunn (;-64,.) 


gQ w=—M m=—-& 


X cos E =p) = moo | (14) 
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Fig. 4—Equivalent circuit. 


For x = 0 (center of the guide) this results in 


WL 





Qn ; M co 1] 
=—_— —JjBgz —|])m = 
Ey ja Oe (Damn (5 84.) 


ga uw=—-M m=—-0 


X cos (Fp, + moo) (15) 


Combining this result with Eqs. (1) for x = 0 we find for the total 
(unperturbed plus perturbation) field for 7z/a > 1 


2Twpu 





Eyeyen = eJBgz + e JB gz - e ~JBez 


Ba 


Xx s 3 (-1)™ey meysrdm (5-64,.) cos (@z + moo) (16) 


C u,meyenis written here to distinguish between the values of c,,m for even 
and odd excitation. For the equivalent circuit of Fig. 4, which is valid for 
the plane z = 0, we obtain from Kq. (16) the reflection coefficient 


QT we 





poy? = £1 
Ba 


xy (-D) "eumeglm (5 By) cos (= pu + moo) (17) 


u=—M m=—-0 





with 
: 2 
jab 1 
)]B 
Peven — 9 (18a) 
X+—+1 
J iB 
jx —-1 
odd =; 18b 
Podd jX+1 (18b) 


These equations permit the calculation of X and B once the values of 
Cu,mexep are Known. 
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We note, that because of the structural symmetry with respect to 
x=0 


Cai = (Dee (19) 
for » = 0, +1, +2... Also, since we have 
Tysylb) = £1 exer (2 — ) (20) 
it follows that 
Cy—meyn = = = mex (21) 


Eggs. (19) and (21) permit reduction of c,,m for negative values of » and/or 
m to those with positive values. 


VI. PERTURBATION FIELD ON POST SURFACES 


Referring once more to Fig. 2 we get for the perturbation field on the 
surface of a post v from Eqs. (8) and (4) 
| 


es (LW sy 
p=—-M J0 k=—o 


2 
X Ho® (3| ( d, sin ¢ — ; d, sin v) 
2) 1/2 
+ (py— py + ha — 5d, cos 6 +5 d, cos ¥) | ) dy (22) 


We wish to write for this a double Fourier series with ¢ and yw as inde- 
pendent variables. This can be done with the aid of the so-called “ad- 
dition” theorem (Ref. 5, p. 361) if we impose the condition that the posts 
do not penetrate or touch each other or the narrow walls of the wave- 
guide. We obtain 


M 2a 
By=— "Df, Ta) 


«| 2 ee CDM Im (5 Bee) Jn (5 8d.) 
katp,—py>0 


X Hn {B(p» — Pu + ka)} ef rot my) 


+ 3 3 ae n{_ 
=. a a6 G Bd, ) 7 ( Bd) 
oO 


He 1 {8(py — py + ka)} efro+my) (—1)ntk 


+lim > dp (5 Bd,) H® 6 «Bd, eine | dy (23) 
K>1 n=—© 
u = vonly 
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Based on physical reasoning (summing contributions of current fila- 
ments in different order, integrating around each post before summing) 
we now exchange the order of summations and integration in Kq. (23) 
and carry out the integration. This leads to 

ra) M oo 
By=— SO" ¥ tn (Sede | XX Im (5 8dn) Cum 
2 u=—M m=—0© 2 


~ (-1)*H2,(Blp, — p, + Ral) [sgn (py, — p, + ka)|"+™ 


xX 
k 
Pu-~putka 40 


+ lim H® (5 «Bd, Con | (24) 


k—>1 
With the abbreviation 
XS (-1)tH® (|A + kB) | sen (5+ k) | 
Sono 


= fm(A,B) = (-1)™ f-m(A,B) = (-1)"fm(-—A,B) (25) 
we can write this as 


WUT & 1 : 
By=- 9H" § 4, (Lpa,) eins 
y 2 para, (5 Bde) e 


x| ES Im (5 Bn) Cumfm-n (8(Px — Py),Bal 


p=—M m=—0 


+ lim H®) (5 Kd, )evn | (26) 


x1 
If we take this result for the perturbation field, add it to the incident 
field Eqs. (5) and impose the condition E, = 0 on the surface of the post, 
we get (letting x — 1) 


> Jn G Bd,) eine| cos (Fp. = no) + (—1)" cos (Fp. 7 noo) | 


n=—o - 


= 2h Fn (Sb) em] FE Im (5 Ben) 


p=—M TUS == 99 
1 
x Cu,mexen Im=—n {B(Du “= P»), Ba} 1 HY G Bd, ) cana | (27) 


Because of the uniqueness of Fourier series (Ref. 4, p. 186), this results 
in 


cos Ep. _ no) + (-—1)" cos (Fp. + no) 
M 1 
= oHn Re _& Im (5 Bde) Cumssr fnnl8 (Py, ~ Pe)sBal 


+ H?) G Bd.) cana | (28) 
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This equation holds for n = 0,1,2....It expresses the boundary con- 
dition on the surface of post v for even and odd excitation. If applied to 
all posts y = 0, +1, +2...+M, it expresses the boundary condition on 
all posts. However, because of the symmetry involved, only v = 0, 1, 
2...M are needed. As before, Eqs. (19) and (21) permit reduction of cy m 
for negative values of » and/or m to those with positive values. In sum- 
mary we can say that Eq. (28), if applied forn = 0,1,2,...andv=0,1, 
2...M, will allow us to compute all of the unknown coefficients c, ,. In 
turn, Eq. (17) will then allow us to compute the elements of the equiva- 
lent circuit, which means that our problem is solved. 

Appendix A and Appendix B provide expressions suitable for the 
computation of f,(A,B) in Eq. (28). These alternate expressions are 
essential, because the defining series Eq. (25) converges very slowly, as 
the magnitude of H{?’(z) decreases only with |z~1/2| for large z. The 
derivation of these expressions constitutes the most difficult and labo- 
rious part of this analysis. For convenience the results are repeated below 
in the form most appropriate for Eq. (28). From Eqs. (40) and Eq. 
(41), 


4 
fm(8p, Ba) = — tan ¢o cos (ag a) eim($o-/2) 
T a 2 


2d nxp mr 
ae 


df call a a 2 
seat ty] i] [2 +\V(2)-1 I 
or i 
.1*2 (m—n-1)! 
+j- > 
m—2n 
rN 
x hm—2n—1 (cos a) (29a) 
. 1p a 
a sin —— 
a 
fom-1(8.Na, Ba) = 0 (29b) 


fom(BNa, Ba) = = tan bo(—1) N+ mej2m(o-n/2) 


2X 
ae (- 1)N+m 


fe 


1 2(-1)N*+"(m + n — 1)1(22"-1 — 1) Bo, aes 





ees ¥ (29c) 


T n=0 (m —n)!(2n)! 
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From Eq. (34e) and Eq. (34f) 











a 1 
=— j(¢o9—77/2) ‘a 9 FP. 
fo(8p, Ba) ee e/ +77 In (cot a 
a 4 cos “25 
= ——— $$ _ (29d) 
Wey Wer 
N 
fol8Na, Ba) = =! ejtou-eP — (1) + j= (C+ In) (—1) 
cos do 
1 4(—1)N 


TT eee. |v (2-1 ny “=| [224 WV (By -1 nn =a] ] ao 


These expressions are valid if 
2 
2a >> = 


p #0, +a, +2a.. 
m = 1,2,3,... 
N =0, +1, +2... 


The polynomials h,,, (x) are defined in Appendix B, Eq. (42). B,, is the 
nth Bernoullian number and C is Euler’s constant. 


Vil. NUMERICAL RESULTS 


A series of calculations was made to investigate the question of con- 
vergence and to ascertain that the rather involved analysis is error-free. 
To this end the reactances X — 2/B and —X [Kas. (18)] were calculated 
for the cases of single, double, and triple posts with \/a = 1.2, p;/a = 0.25 
and Bdo = Bd; = 0.2 and 0.4, employing increasing numbers of variables 
and equations. Furthermore, the computed results were compared with 
measured data where such data were available. Lacking a full-fledged 
computer program the calculations were carried out with the aid of a 
programmable desk calculator, except for the matrix inversion, for which 
a general-purpose computer program was used. 

Table I summarizes the results of this work. The first observation that 
can be made is that, as expected, the convergence obtained for single and 
double posts is excellent. Three terms in the Fourier series for the post 
currents is all that is needed to obtain six place accuracy for the reac- 
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Number of posts A Bdy Bd, Winax xXx- B —X 
1 — 0.2 — 0 1.121835970 — 
1 _ .009450748398 
2 1.121835438 — 
3 a .009450749381 
4 1.121835438 — 
Meas. 1.12 .010 
0.4 0 .6546985053 — 
1 = .03659710000 
2 .6546719818 _ 
3 — .03659716655 
4 .6546719813 = 
Meas .655 .037 
2 0.25 = 0.2 0 1.121835969 — 
1 1.100680471 .009467748389 
2 1.100679708  .009467748828 
3 1.100679707 .009467799814 
4 1.100679707  .009467799806 
0.4 0 .6546985046 — 
1 .6071886432  .03659710003 
2 .6071426563 .038685942441 
3 .60714238386 .03685949132 
4 .6071423830 .038685949138 
3 0.25 0.2 0.2 0 .2599117670 — 
1 .2578995041 .01871192452 
2 .2578489230 .01872800993 
3 .2578488656  .01872801335 
4 .2578488652  .01872801340 
Meas. .255 .020 
0.4 0.4 0 .02634303527 — 
1 .02329626388 .07043107151 
2 .02238859942 .07063929772 
3 .02238444183 .07063993488 
4 .02238437265 .07063993658 
Meas.  .0265 074 


Table | 


2 





tances, which is more than enough for any technical application. The 
second observation is that, as was hoped, excellent convergence continues 
to exist for triple posts, even though in that case two sets of infinitely 
many unknowns are encountered instead of just one. Even for posts with 
susceptance values as high as B = 20 no more than four terms in the 
Fourier series are needed to obtain five-place accuracy. Presumably the 
analysis will converge even for four or more posts, but these arrange- 
ments are of little technical interest and thus probably not worth in- 
vestigating. Finally, when comparing the computed values with mea- 
sured data obtained with the aid of a very precise, computer-operated 
transmission measurement set,!° sufficient agreement is found to as- 
certain that the analysis is free from any fundamental error. 


APPENDIX A 


We study the following series 
2 e ~BV[Q2n-Drtt}?+2? eos {[(2n — 1)a + t]C} 


2B Ct = 2 [On —Drttpre 


n=—o 


(30) 
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with t as a real variable, B = O and real, C real, Rejz} > 0, Re 
{V[(Qn — 1)a + t]? + 24} > 0. This function is even in ¢ and also periodic 
in t with the period 27. For reasons which will become apparent later 
we wish to develop it into a Fourier series in f 


1 co 
—agt+ > az coskt (31) 
2 k=l 


Without going into the fairly laborious detail the result is 


fle, B,C,t) == y (-1* Kolz|VB2 + (CFR cos kt (32) 
k=—o@ 


for the conditions stated for Eq. (30) plus either B > 0 or B = Oand C 
~0,+1,4+2....Setting t = 0 leads to 


2 e BV (2n~1)?n?+2 eos [(2n — 1)C] 


2 Vana Date 
a (-1)* Kolz| VB? + (C+R)2|) (33) 


provided Rez} > 0, Re{V (2n — 1)2x? + z?} > 0 and either B > 0 and real, 
C realor B =0,C #0, +1,+2...and real. The validity of Eq. (43) can 
be extended to include Re{z} = 0 by analytic continuation. In doing this 
the points z = 0 and z = +)(2n — 1)z obviously have to be excluded, since 
at these points individual terms of the sums involved are not analytic. 
The result of the analytic continuation is for z > jA 


Se (“1 HPA VB? + (CFR) 


k=—o@ 
2 9—BV(2n-1)2x?—A? 
= 4] ——_———————— cos [(2n — 1) 34 
d 2» (2n — 1)2x2 — A? \ i 
with 
A > 0, real 
A. 4. 8H, OR a8 (34b) 
Tv 
argv (2n — 1)2x2 — A? = Oor ‘ 
and either 
B> 0, real 
C real (34c) 
or 
B=0 
C 40,+1,+2..., real (34d) 
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For the latter situation [Eq. (34d)] Eq. (34a) may be rewritten in the more 
rapidly converging form 

oo k 2) = 1 2 1 . co 

S (-LFHP(AIC + k|) = jon (cot 5A) +4 > 

hk=—a@ T 2 n=1 

Aco [2m = DC] 

(2n — 1)a V(2n — 1)2x? — A? [(2n — 1)ae + V(2Qn — 1)2x? — A? 

(34e) 

We also need a result corresponding to Eq. (34e) for C = N = 0, £1, 

+2.... We observe first that in the left hand sum the term k = —N has 
to be excluded for obvious reasons. Furthermore it is 


E> (-1)*HE(AIN + K|) =(-DN XS (-DRH(AIRI) 
=—0 k=—o 
kyon kx<0 


= 2(-1)N > (-1)FHP(AR) (35) 


k=1 
An alternate expression for the last sum is known (Ref. 6, p. 333). We 
get 


SX (-1)*HPAIN + k]) = (DN i i oie (C +1n “) 
oaks W 4a 


k#-N 


2 1 1 
ace) le —1)?a2— Ae all 
= (-1)N =! +9j= (c+in©) +45 > 
Tv Tv n=] 
A2 
(Qn — a(n — 12x? — A2|(2n — I) + VOn —- 1x — AZ | 
(34f) 


with N = 0, +1,+2...and Eas. (34b) in force. Note that C in this last 
formula is Euler’s constant. 


APPENDIX B 
We study for m = 1,2... the series 


fm(z, C, t) = 


foe] 


2m cos {| @n— De +e c—m>} 


(36) 
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with ¢ as a real variable with the range —z < t < z, C real Re{z} > 0, 
Re{vV[(2n — 1)x + t]? + z2}> 0. Analogous to the situation in Appendix 


A we wish to develop fm(z, C, t) + fm(z, C, —t) into a Fourier series 
a + >> ay, cos kt 
2 k=l 


over the range —z < t < z, thereby continuing it periodically beyond that 
range. Again omitting the fairly laborious detail we get for C ¥ 0, £1, 
= ee 


falZ C, t) bin (2; C, —t) 


a (-1)™+1[4S,, jCz)eim*/2 — 8K, (|C|z)(sgn C)™] 


- as (—L)ert 3 (—1)*[2S,,( (Cz + jkz)eim/2 
An k=1 
+ 28m (GCz — jkz)eim*/2 — 4K,,(|C + k|z){sgn (C + k)}™ 


24K C= lian (C= Des ES > (-1)m 
Tv 


XY (-DA[2Km([C + k]z){sgn (C +k} 
k 


=—o 


— eimn/25,,(jCz + jkz)| cos kt (37a) 
and forC = N =0, +1, +2... 
fm(2; N, t) + fn(z, N, —t) 
1 © 
= oe (-1)™ S° (-1)*[2K,(|N + klz){sgn (N + k)} 


kw—o 
k#—N 


. : 1 
—elmn/2S, (JNz + jkz)] cos kt + — (—1)% cos a cos Nt (87b) 
mr 


In these equations S,,(z) denotes Schlaefli’s polynomial (Ref. 5, p. 313). 
For ¢t = 0 this results in 


z™ cos | (an —1)rC - ms | 


Pa V(2n — 1)2x?2 + 22 [(Q2n — 1)e + V(2n — 1)202 + 22|™ 
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provided C ~ 0, +1,+2...,andin 


= z™ cos | (en — Dav ~ m5] 


a ~ ae? (—1)* [2K n(|N + k|z) {sgn (N + k)}" 


k#-N 


: . : 1 
—elmn/2 §,(GNz + jkz)] +—— (-1)% cos ue (38b) 
2nr 2 


for N = 0, +1, +2.... By working on the two series with Schlaefli’s 
polynomials the following alternative expressions are obtained 


i z™ cos | en — Dac - m5] 
= > (S1)" > (-1)*Km(|C + k|z)[sgn (C + k)]™ 
T k 


+ einen So mn 


on\ m—2n 


jz 


qm-2n-1 1 
= sin ae (39a) 
5. (-1)'Kay—-1(|N + k]z) sen (N +k) = 0 (39b) 
aN 
fn) z2A(—1)NtA 


2 Vn a1 Felon — 1) + Vn = Pee FD 
SS (AK AUN Phe eee 
pee re 


an 
k#-N 
A (A+ — 1224-1 — 1)Bop 202k 
~ 4b (yyner om t (=) (39c) 
20 k=1 (A — R)'(2k)! 


where, again, C # 0,+1,42...,real,N =0,41,42...,m=1,2..., 
A=1,2...,Re{z}>0, RelV(2n — 1)?x2 + z?} > 0. Following the same 
argumentation as in Appendix A the validity of Eqs. (39) can, by analytic 
continuation, be extended to include Re{z} = 0 with the exception of z 
= QO and z = +j(2n — 1)z. The result is for z — jA with A > 0 and after 
some rearrangement 
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5 (-1)H2(|C + k}A)[sen (C + k)]™ 
ea 


A™ cos | (en —1)acC - | 


= Ay = ae ee eS 
Ae V(2n — 12x? — A2[(2n — 1) + Vn — 12a? — API” 





. 1 (m—n—1)! /2Qr\m-2n 
—= —]1 m— Pte IEE ek ES eS peas 
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7A 4 
ee sin _ (40a) 
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me 
X (-LEH®) (|N + RIA) = 4j(-1)*™ 
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2 A2m 
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valid form =1,2...,N=0,+1,+2...,C 40, +1, +2 and real, A > 
0, realand A ¥ a, 3a, 5a... , and with arg {V/(2n — 1)?x? —A?} = 0 or 
7/2. Eq. (40a) can be written in the following more convenient form 


(LAH 2(|C + k[A)[sen (C + ky] 


k=—-—@ 
Am cos| (2n —1)rC—m 5 


~ 4) 2 V@n aia? = Al(Qn — lyr + Vn ~ 1) AQ 

AS eee Qn 

Grace 

subject to the same restrictions as those enumerated for Eqs. (40). The 
polynomials h,(u) appearing in this equation are defined by 


m—2n 
Ar =0 n'(m — In - 1)! ) Am-—2n—1(cos Cr) (41) 








dad” 1 
h,(u) = (-1)" | sin nt+ly . | (42) 
dx" sin x Jcos x =u 
and can be shown to satisfy the following recursion formula 
which begins with 
dhy,- 
hn(u) = nuh,-;(u) + 1 — a (48a) 
ho(u) = 1 (43b) 
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It appears impossible to give a closed form expression for these poly- 
nomials, but their coefficients can easily be calculated by the following 
scheme which is a consequence of the recursion formula 


1 ms 
; wee ks | 
3 oe Te 
4 ae oe Sy 
6 se ee a ae oe ee, 
Le., it is 
ho(u) =] 
hy(u) =u 
hou) =1+u2 


h3(u) = 5u + u3 

hau) = 5+ 18u?2+u4 
It can be shown, incidentally, that the sum of all coefficients of h,(u) 
is equal to n! and that the coefficients of uw® and u! are Euler’s numbers. 
It furthermore appears, but has not been proven, that the coefficients 
are equal to those in Table 7.2.2 of Ref. 7, p. 260, the “number of per- 
mutations of the first N natural numbers with t, runs up.” 
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Measurements of Loss Due to Offset, End 
Separation, and Angular Misalignment in Graded 
Index Fibers Excited by an Incoherent Source 


By T. C. CHU and A. R. MCCORMICK 
(Manuscript received July 20, 1977) 


Transmission losses versus fiber end offset separation, and angular 
misalignment of graded index fibers excited by an incoherent source, 
have been measured in two independent experiments. The measure- 
ment setup, fiber diameter, and length were different in the two ex- 
periments, yet the measurement results are strikingly similar. The loss 
measurements clearly show that transverse offset is much more critical 
in connector and splice design than angular misalignment and end. 
separation. Two-tenths of the fiber core radius in transverse offset alone 
may cause 0.5 dB loss while one fiber core radius in axial separation 
combined with 1° in angular misalignment may cause 0.5 dB loss. 


l. INTRODUCTION 


It is essential to know the transmission loss caused by misalignment 
of the fiber ends in designing fiber connectors and splices. Graded index 
fibers are important to fiberguide transmission applications that require 
low dispersion characteristics. The study of the transmission loss caused 
by misalignment of fibers having graded index profiles is thus necessary. 
Theoretical investigations of the loss versus offset at zero axial separation 
have recently been published.!* Further studies of the problem—i.e., 
loss versus offset, end separation, and angular misalignment of graded 
index fibers—have been done experimentally.>-’? This paper presents 
the results of two separate experiments. 


ll, EXPERIMENTS 


The experiments were conducted independently in different labora- 
tory locations. The first experiment (Fig. la) yielded the loss versus offset 
and end separation only. The second experiment (Fig. 1b) included 
angular misalignment along with end separation and offset. In both 
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Fig. 1—(a) Coupling loss vs. fiber end misalignment measurement setup in the first 
experiment. (b) Coupling loss vs. fiber end misalignment measurement setup in the second 
experiment. 


experiments, a Burrus-type LED having a 50 wm diameter emitting 
surface was used. The LED in the second experiment was internally 
modulated whereas the first was not modulated. Microscope objectives 
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COUPLING EFFICIENCY IN PERCENT 
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(a) 


COUPLING EFFICIENCY IN PERCENT 
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(b) 


Fig. 2—(a) Coupling efficiency vs. normalized offset d/R at various separations s/R from 
the first experiment. (b) Coupling efficiency vs. normalized offset d/R at various separa- 
tions from the second experiment. 


were used to collect and focus the light into the launching fiber. Align- 
ment was achieved by using micropositioners. In both experiments the 
output of the receiving fiber was detected by a power meter and moni- 
tored by a digital multimeter. 

Graded index fibers were used in both experiments. The first exper- 
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Fig. 3—Loss in dB vs. normalized offset d/R, separation s/R, and angular misalignment 
a°/sin-! NAp. 


iment used a 50 wm diameter core/100 um diameter cladding fiber while 
the second used a 55 wm diameter core/110 um diameter cladding fiber. 
The indices of refraction of the core center and cladding.of both fibers 
were 1.472 and 1.458 respectively. A 1.83 m length fiber was used in the 
first experiment and a 20 m length in the second. 

In both cases the experiments began by optimizing the power output 
from the fibers. The fibers were then cut in the center and aligned using 
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0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 
NORMALIZED OFFSET a/R 


Fig. 4—Loss vs. normalized offset d/R at various normalized separations s/R from the 
first experiment. 
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LOSS IN DECIBELS 





ANGULAR MISALIGNMENT @ IN DEGREES 


ee 5—Loss vs. angular misalignment a in degrees at various normalized separations 
s/R. 


the micropositioners, and index matching fluid (glycerol) was applied 
to the joints. The power output in the first experiment was measured 
to be 0.01 dB less than the maximum power obtained before the fiber 
was cut. This figure was 0.07 dB in the second experiment. 

The loss versus offset measurement (in both experiments) at zero 
separation was done by offsetting one fiber end (at the butt joint) to the 
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Fig. 6—Loss vs. angular misalignment a in degrees and various normalized offsets d/R 
at constant separation s/R = 1. 
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LOSS IN DECIBELS 
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Fig. 7—Loss vs. angular misalignment a in degrees and various normalized offsets d/R 
at constant separation s/R = 2. 


other by known amounts and the power output of the receiving fiber was 
recorded. This was repeated at normalized axial separations of 1, 2, 3, 4 
and 5. The normalized separation and offset are defined as s/R and d/R, 
where s is the axial separation in ym, d is the offset in um, and R is the 
fiber core radius in wm. The loss-versus-angular misalignment mea- 
surement (in the second experiment) began with aligning the receiving 
fiber with the center of rotation of the table so that the angular alignment 
could be changed while the axial separation and offset remained con- 
stant. The angular alignment was varied from —3° to +8° in increments 
of 0.2° at normalized axial separations of 1, 2, 3, 4 and 5. 


Ill. RESULTS 


The coupling efficiencies in percentage-versus-normalized offset at 
six normalized axial separation are shown in Fig. 2a and b (first and 
second experiment, respectively). The facts that the results of two ex- 
periments are very similar and the transverse offset is by far the more 
important parameter can be seen in Fig. 3, in which the loss-versus- 
normalized offset d/R at zero separation, the loss-versus-various nor- 
malized separations s/R at zero offset, and the loss-versus-normalized 
angular misalignment a°/sin~!NAg at constant separation S/R = 1 are 
plotted. Here NAg = Vn? — nj and n; and ngare the index of refraction 
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0.5 dB LOSS 0 @% sin~1 NAg =0 
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——--— 1.5 dB LOSS D = 0.174 


NORMALIZED SEPARATION s/R 





0 0.1 0.2 0.3 0.4 0.5 
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Fig. 8—Constant loss lines as the results of fiber end offset d/R, separation s/R, and 
angular misalignment «°/sin—! NAo. 


of the fiber core center and cladding, respectively. The difference be- 
tween the two experiments at zero offset and zero separation is due to 
the different amount of initial misalignment of the fiber ends after it was 
broken and butt-jointed. In the first experiment, the power output from 
the receiving fiber was 0.01 dB below the maximum power obtained 
before the fiber was broken; this figure was 0.07 dB in the second ex- 
periment. The designers of fiber connector or splice will be interested 
in the region where loss is low. Figure 4 shows the loss in dB versus small 
‘ofset (d/R <0.8) at various separations. Figure 5 shows the loss due to 
angular misalignment at normalized separations of 1 through 5. Figures 
6 and 7 show the loss due to angular misalignment and offsets at nor- 
malized separations of 1 and 2, respectively. Figure 8 shows constant loss 
curves as caused by various kinds of misalignment. As an example, 
consider various kinds of misalignment that all produce 0.5 dB loss: a 
normalized offset of 0.2 alone; a normalized separation of 2 alone; a 
normalized angular misalignment of 0.087 and normalized separation 
of 1; a normalized offset of 0.1 and normalized separation of 1. Designers 
of connectors will have to pay very close attention to offset, then angular 
misalignment and separation, respectively. 


Iv. CONCLUSION 


Loss versus various kinds of misalignment of two ends of the same 
fiber has been measured in two independent experiments. The mea- 
surement setup, fiber diameter, and length were different in the two 
experiments, yet the measurement results are strikingly similar. 
Transverse offset is shown to be the most critical parameter in the design 
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of fiber connectors and splices. The present results provide only the 
minimum loss that would arise in actual fiber connectors and splices, 
since additional losses might be caused by other factors such as fiber 
diameter and index profile mismatch. 
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Design and Experimental Optimization of a 
Canister Antenna for 18-GHz Operation 


C. A. SILLER, JR., P. E. BUTZIEN, and 
J. E. RICHARD 


(Manuscript received May 27, 1977) 


The design and experimental optimization of a canister antenna 
for operation in the 17.7 to 19.7 GHz frequency band are described. The 
canister is specifically designed to accommodate the antenna, trans- 
mitter and receiver units, and to be aesthetically innocuous in its en- 
vironment. The basic antenna configuration, that of a shielded inverted 
periscope, is reviewed. Details of the constituent parts of the antenna, 
including dual mode feed, paraboloidal reflector, mirror, radome, and 
microwave absorber are presented. The influence that each of these 
items plays in determining net electrical performance is identified, 
including (where appropriate) steps taken to achieve electrical opti- 
mization. The antenna is shown to afford excellent sidelobe suppres- 
sion, azimuthal plane cross polarization discrimination in the mid to 
upper 30 dB range, and a return loss of better than 23 dB. The gain 
efficiency ts approximately 62 percent and is essentially polarization 
independent. 


I. INTRODUCTION 


The design and experimental optimization of a canister antenna for 
point-to-point operation in the 17.7 to 19.7 GHz frequency band are 
described in this paper. The antenna, which has been briefly described 
earlier,! was designed for a digital radio system (DR 18A).? The system 
will use a 274-Mb/s quaternary phase shift keying (QPSK) modulation 
to provide eight radio channels (seven working, one protection). Each 
channel provides 4032 voice circuits. Anticipated typical repeater 
spacings will be 2.4 to 7.2 kilometers.? The canister is specifically de- 
signed to accommodate the antenna, transmitter and receiver units, and 
to be aesthetically innocuous in its environment. 

The initial antenna concept was first proposed by Crawford and 
Turrin‘ in 1969. Subsequently, the authors, drawing in part upon the- 
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Fig. 1—DR 18A repeater site located in Methuen, Massachusetts. 


oretical and experimental work of colleagues at Bell Laboratories, de- 
signed an antenna for incorporation into a mast-supported, integrated 
canister antenna concept. A photograph of an existing repeater site in 
Methuen, Massachusetts, which may be characterized as “‘typical” in 
appearance, is presented in Fig. 1. 

As depicted in the cut-away view in Fig. 2, the antenna is basically a 
shielded inverted periscope consisting of a paraboloidal reflector, feed 
assembly, mirror and inclined radome. The feed, the end of which is lo- 
cated at the focal point of the parabolic dish, illuminates the paraboloid 
with a spherical wave. Upon reflection, the energy is converted to a plane 
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Fig. 2—Schematic depiction of canister antenna showing salient features. 


wave which is subsequently incident upon a mirror nominally inclined 
at 45° to the paraboloidal axis. The energy is then reflected from the 
mirror (which is tiltable to deflect the beam up or down slightly) and exits 
through the circular aperture. Low wide-angle sidelobes are achieved 
by relying upon the absorber-lined canister to afford electromagnetic 
shielding and reduce edge diffraction at the aperture. It may also be 
noted from this figure that the canister serves as housing for the mi- 
crowave networks as well as transmitter and receiver modules. This 
paper deals specifically with the influence of the various components 
of the antenna on its electrical performance and (where appropriate) 
steps taken to achieve electrical optimization. 

Electrical objectives to be attained in the antenna design are dictated 
by the radio system. After a study into the regional variation of rainfall 
and its role in setting repeater spacings, the gain objective was estab- 
lished as approximately 42 dB above isotropic at the low edge of the 
frequency band since gain increases with frequency. This gain objective 
included transmission loss through the radome. The radiation pattern 
determines the maximum number of radio paths which may converge 
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at a point. With a goal of eight such paths and a desired discrimination 
of 60 dB, the pattern was to drop to < —60 dB at 45° in azimuth. Finally, 
the return loss of the dish and window was to exceed 26 dB, i.e., voltage 
reflection <0.050. 

Considerations involved in the selection of a suitable feed are reviewed. 
Since the influence of the feed pattern on gain cannot be divorced from 
the selection of a paraboloid, the inter-relationship of feed taper and dish 
f/D ratio is delineated by computing both illumination and spillover 
efficiencies to obtain net antenna efficiency. The optimum position of 
the feed with respect to the mirror is also explored, and the influence of 
the hole around the feed on antenna radiation patterns is elucidated. 
The wide-angle radiation suppression of the antenna is shown to be 
acutely dependent upon the use of broadband microwave absorber. Since 
antenna directivity is also affected by the presence of a radome, the tilt 
of the mirror, and the correct positioning of the feed, the role which these 
items play in influencing radiation suppression is assessed. Measured 
return loss and the sources which give rise to reflected power are also 
discussed. The paper is concluded with a summary of the electrical 
characteristics of the antenna design. . 


ll. ANTENNA FEED 
2.1 Properties of dual mode feeds 

An objective in the design of this antenna was the attainment of cir- 
cularly symmetric feed patterns for the illumination of the paraboloid. 
It may readily be shown that such a feed, with an illumination function 
characterized by 


Eo = AF(@) sin @ (1) 
Ey = AF(@) cos ¢ (2) 
|E| = (Eo? + E42)? = AF(8) (3) 


where 6, ¢ are the conventional spherical coordinates, results in a com- 
pletely linearly polarized field distribution after reflection from a full 
paraboloid. As a consequence, there are no cross polarized fields in the 
aperture. In addition, this feed illumination readily permits one to design 
for optimum polarization-independent gain. 

Fields like those given in eqs. (1) and (2) are achieved by using dual 
mode and hybrid mode feeds. For this antenna, a dual mode feed was 
found to be effective, as well as simpler and less costly to produce than 
the hybrid mode feed. Chu® has noted that the observed radiation pat- 
tern of a dual mode feed is approximated closely by the H-plane pattern 
of an open-ended circular waveguide excited by the dominant TE9 
mode: 
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EDGE ILLUMINATION TAPER INCLUDING SPHERICAL ATTENUATION, 
dB REFERENCED TO ON—BEAM VALUE 


3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 
u(FEED CIRCUMFERENCE IN WAVELENGTHS) 


Fig. 3—Characterization of feed illumination, including spherical attenuation to a 
paraboloidal surface, for various values of the subtended dish half-angle, 4. 


J,(u sin 6) 


7 | sin ali 
1.841 

where u = xd/X, d is the diameter of the feed waveguide, \ is the wave- 
length of the applied signal, and @ is the angle measured from the axis 
of the feed. The region of validity for eq. (4) is  S u S 2z, or equiva- 
lently, the waveguide diameter is limited to one to two wavelengths. 

The field intensity at the edge of a paraboloidal reflector may be 
computed as a function of u by using eqs. (38) and (4) and including 
spherical attenuation to the paraboloidal surface, cos?(6/2). The latter 
comes from the fact that the equation of a paraboloid in spherical 
coordinates is r = 2f/(1 + cos@). The results of such a computation are 
presented in Fig. 3 for various values of 6,, the subtended half-angle of 
the paraboloid. The use of 6, instead of f/D is preferred by the authors 
because of its simpler physical interpretation. It is noted that f/D = 
1/(4tan 6,/2), where f and D are the focal length and reflector diameter, 
respectively. From Fig. 3, selection of any two of the following three 
variables uniquely specifies the third: edge taper in decibels, feed di- 
ameter in wavelengths, and subtended half-angle in degrees. The utility 
of this figure arises in optimizing the gain efficiency of the antenna, the 
subject of the next section. 


F(6) = [V1—(1.841/u)2 + cos 6] - (4) 
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2.2 Relationship of feed taper to efficiency 


The gain efficiency of a center-fed full paraboloid may be broken down 
into an illumination efficiency and spillover efficiency (as in Ref. 6). The 
concept of an illumination efficiency is well known; spillover efficiency 
specifies that fraction of the total power radiated by the feed which is 
intercepted by the paraboloidal dish. Denoting these terms by 7; and 
ns, respectively, they are mathematically evaluated from the feed illu- 
mination function. Namely, 


Qr 06 6 
f F (0) cos? (5) d> 
6=0 J0=0 
er (5) 


At {Peet} as 
» Soa she ne) aes ()| 


In eq. (5), the projected aperture area is 


2 











1 — cos 4, 
1+ cos 4 


and the integration is performed over the projected aperture of the re- 
flector with differential area element 


Ap = 4xf? 


_ f* sind 


O\4 
(ost 


The spillover efficiency is given by 


dd d0d¢ 


9 
f, [F(0)]2 sin 646 
0 
Ns = (6) 
a/2 
f, [F'(0)]? sin 0d0 


The total antenna gain efficiency is expressed by the product, viz. ng = 
nins. The evaluation of yn, given by eq. (6) is approximate insofar as it 
neglects back radiation from the feed. Radiation into the hemisphere 
a/2<0<7,0< ¢ S 27 is not accurately described by eq. (4), and is so 
low as to contribute virtually nothing to the evaluation of 7. 

The terms ng and n, have been computed as a function of dish edge 
illumination (spherical attenuation and feed taper at an angle 6,) for 
a variety of subtended half-angles. (Selection of a dish edge illumination 
and half-angle implicitly determines a feed diameter as shown in Fig. 
3). Typical aperture efficiencies, expressed in decibels down from full 
area gain, are presented in Fig. 4. The figure shows that for a specified 
angle 6,, the optimum efficiency occurs for edge tapers of —11.5 to -12.5 
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INCLUDING SPHERICAL 
ATTENUATION, dB 


Fig. 4—Antenna efficiency as a function of edge illumination for various values of 
subtended dish half-angle, 65. 


dB. Additionally; for optimum taper, the least loss in gain occurs for a 
half-angle of approximately 35°. 

While it would appear from the above analysis that a half-angle of 35° 
should be selected, two additional considerations are germane. First, 
from an electrical standpoint, edge tapers of —11.5 to —12.5 dB for small 
values of 6, necessitate relatively large feed diameters (See Fig. 3). Large 
feed diameters are deleterious because they can support higher order 
modes, the effect of which will be demonstrated subsequently. They also 
increase feed blockage in the antenna, thus deteriorating sidelobe sup- 
pression in the radiation pattern. In regard to this latter point, theoretical 
analyses of this antenna by Anderson’ indicate that the sidelobe levels 
in that angular region where the imaged paraboloid would be visible to 
an observer are insensitive to field illumination at the edge of the dish, 
an observation that has been experimentally substantiated elsewhere. 
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within Bell Laboratories. He concluded that the level of radiation sup- 
pression is principally influenced by feed blockage. This conclusion is 
validated by an experimental study discussed later in the paper. The 
second point arguing against a 35° half-angle is aesthetic, in that such 
a paraboloid would require a canister more slender and tall than con- 
sidered appropriate. Feed blockage is estimated to have a negligible ef- 
fect on the gain of this antenna. 

There is also a reason why angles greater than 45° are not especially 
desirable. The geometry of the antenna dictates that for angles greater 
than 45°, the feed must protrude above the mirror. Yet experimental 
evidence indicates that this, too, manifests itself as feed blockage, and 
can deteriorate sidelobe suppression. For these reasons, a subtended 
half-angle of 40° appeared reasonable. 

As stated in the Introduction, the gain objective of this antenna was 
approximately 42 dBi at 17.7 GHz. Experience indicates that it is diffi- 
cult to realize gain within 0.5 dB of theoretical, so the antenna was de- 
signed to theoretically provide a gain of 42.5 dBi. Using Fig. 4 with 0, 
= 40° implies that this gain is achieved by selecting a reflector diameter 
of 0.813 meters and f = 0.559 meters. Because of limitations in available 
tools for spinning the dish, a focal length of 0.536 meters was actually 
used. For a dish of 0.813-meter diameter, this corresponds to 0, = 41.4° 
or f{/D = 0.66. Tolerances of the spun aluminum paraboloid were 0.0254 
centimeters rms and a peak deviation of less than 0.0305 centimeters 
from the design surface. At 19.7 GHz, the highest anticipated operating 
frequency, these correspond to \/60 and 4/50, respectively. 


2.3 Feed designs and performance 

Two dual mode configurations (as depicted in Fig. 5) of square and 
circular cross section with a variety of aperture sizes were considered. 
The principle of operation of these dual mode configurations is that of 
achieving a two-mode mixture of fields at the feed aperture such that 
the edge currents are nearly zero.® In Fig. 5, we may consider a TE, mode 
(for the case of circular cross section) propagating upward from the 
uniform waveguide at plane a. This wave encounters an abrupt change 
in cross section at plane b such that the electric field is bent to maintain 
a vanishing tangential component at the conducting walls. An axial 
component of the total electric field (F,) is thereby generated and by 
plane c conversion of some of the energy to the TM, mode has been 
accomplished. The distance from c to d provides the proper phasing of 
the two modes at the feed aperture since the two modes propagate at 
different phase velocities. Since the two modes do travel with different 
phase velocities, the feed does have a bandwidth limitation. Nevertheless, 
the feed provides proper performance across the 17.7 to 19.7 GHz 
band. 
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Fig. 5—Dual mode feed cross section. 


Two sizes of dual mode feed were built and tested in both the round 
and square cross section geometries. The feeds of square cross section 
were found to have higher cross polarized fields in the +45° planes than 
the circular feeds, and were therefore no longer considered. 

Two sizes of feed were built because the aperture diameter-to-wave- 
length ratio determines the illumination taper of the feed. As noted 
above, the canister antenna was designed with a subtended half-angle 
of approximately 41.4° between the feed and rim of the paraboloid. To 
attain optimum gain efficiency for this 0.66-f/D paraboloid fed by a dual 
mode feed requires an edge illumination (including spherical attenuation 
to the paraboloidal surface of just over 1.1 dB at 41.4°) of —12.1 dB (from 
Fig. 4). Using Fig. 3, such a feed requires u = 5.4 and hence has a diameter 
of 2.92 centimeters at 17.7 GHz. This diameter, however, allows four 
higher-order modes to propagate between planes c and d (Fig. 5). 
Therefore, a smaller version with a 2.44-centimeter diameter which just 
cuts off beyond the TM, mode was also built. Both of these models had 
cone angles of a = 30°. 

It was found that due to higher order modes (TM9,, TE?, TES, and 
TM®) which could propagate, the larger diameter feed produced H-plane 
radiation patterns which fell off much too rapidly over a large part of the 
frequency range of interest. This effect is depicted in Fig. 6a. 

The 2.44-centimeter diameter dual mode feed was extensively tested 
and was subsequently chosen as the feed for this antenna. The length 
of the drift section was adjusted empirically to 3.81 centimeters to obtain 
the best match of E-plane and H-plane taper at mid-band. The feed il- 
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Fig. 6—Dual mode feed tapers as a function of frequency: (a) H-plane measured edge 
tapers of a 2.92-centimeter diameter dual mode feed; (b) H-plane measured and theoretical 
edge tapers of a 2.44-centimeter diameter.dual mode feed; (c) E-plane measured edge tapers 
of the 2.44-centimeter diameter dual mode feed. 


612 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 
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Fig. 6 (continued) 


lumination tapers (no spherical attenuation) as a function of frequency 
for the H-plane and E-plane are shown in Figs. 6b and 6c, respectively.* 
Each point represents the power average of the edge taper at 41° on both 
sides of the feed radiation pattern like those shown for the E- and H- 
planes at 17.7 GHz, 18.7 GHz, and 19.7 GHz in Figs. 7a, 7b, and 7c, re- 
spectively. The patterns in Figs. 7a, 7b and 7c were measured on a small 
indoor range. Note from Figures 6b and 6c that E-plane and H-plane 
tapers are the same only near the design frequency, implying best illu- 
mination symmetry at that point. As the signal frequency departs from 
the design frequency, the illumination in the aperture of the paraboloid 
becomes increasingly asymmetric, an assessment of which may be ob- 
tained from these figures. For example, note from Figures 6b, 6c, and 
7a that at 17.7 GHz, the H-plane taper at the edge of the dish is measured 
to be approximately —9.8 dB, while at this same frequency the E-plane 


* As Figs. 6b and 6c show, the final antenna feed was tested at more frequencies in the 
H-plane than in the E-plane. This was done to assure that difficulties which were manifest 
in the over-moded feed (see Fig. 6a) did not occur in the smaller feed. 
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(b) 
Fig. 7—E- and H-plane radiation patterns of the 2.44-centimeter diameter dual mode 
feed: (a) 17.7 GHz; (b) 18.7 GHz; (c) 19.7 GHz. 


taper is approximately —7.6 dB. Figure 6b also contains theoretical 
H-plane tapers computed from eq. (4). As the data reveals, at 41° mea- 
sured tapers exceed the theoretical values by approximately 1 dB. This 
observation is also valid for other dual mode feeds with which the authors 
are familiar. Note that the measured taper at 18.7 GHz is very close to 
the optimum (see Fig. 4). 
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Fig. 7 (continued) 


Cross polarized patterns were also measured in both the E and H 
principal planes and the +45° planes. The patterns were generally 
room-reflection limited to better than a 40-dB dynamic range, thereby 
inferring the capability of measuring cross polarized fields to that level. 
Measurements made at more than ten frequencies within the 17.7 to 
19.7-GHz band indicate that near boresight the cross polarized response 
approaches the upper 30 dB range in the +45° planes. Because the an- 
tenna is circularly symmetric from a geometrical optics viewpoint, the 
reflector system itself adds little to cross polarization conversion except 
for small contributions from surface roughness and feed scattering. The 
antenna polarization properties are essentially set by the feed itself. 

The theoretical gain efficiency of the antenna has been calculated for 
41.4° and is plotted in Fig. 8 as a function of field taper at the rim of the 
paraboloid. Also indicated on this curve is the taper achieved with the 
2.44-centimeter diameter feed at 17.7 GHz, 18.7 GHz, and 19.7 GHz 
(obtained from a robust linear regression of data in Figs. 6b and 6c). As 
this figure indicates, the feed should afford near optimum performance 
since the highest efficiency is predicted for the upper portion of the band 
where rain attenuation can be expected to be most severe. 

The gain of this antenna without the radome was measured on an 
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Fig. 8—Theoretical efficiency of prototype design as a function of illumination taper 
at edge of dish. 


outdoor range using the comparison method and a standard gain horn 
as a reference calibration. The measured gain, relative to an isotropic 
radiator, and aperture efficiency are stated in Table I. Observe that 
higher gain for the upper portion of the frequency band has been 
achieved. 

Experience obtained during an earlier CW experiment using this an- 
tenna concept has indicated the necessity to avoid insects and other 
foreign matter getting into the feed and altering electrical performance. 
Precipitation, on the other hand, is not expected to enter the canister 
since the antenna body and window are effective in this regard. In its 


Table |—Measured gain (over isotropic) for 18-GHz antenna 
without radome 


Frequency, (GHz) Gain, (dBi) Aperture efficiency, % 
17.7 41.5 62.2 
18.7 42.1 64.0 
19.7 42.4 61.8 
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customary orientation, the feed is pointing up (see Fig. 2). A variety of 
feed plug configurations designed to inhibit foreign matter from falling 
into the feed or accumulating at the opening were fabricated and tested. 
Each of the plugs considered was formed of commercially-available, 
expanded, closed-cell, polystyrene with a relative dielectric constant ¢; 
< 1.03 and loss tangent 0.002. Each plug was tapered on the outside and 
inside to inhibit the accumulation of foreign matter and minimize re- 
flected power, respectively. The plugs were cemented in place with an 
electrically transparent silicone rubber adhesive and then tested for their 
influence on feed patterns, return loss, and antenna gain across the band. 
The plug configuration shown in Fig. 9 influenced the feed pattern least, 
was invisible in return loss and gain measurements, was simplest in de- 
sign, and was therefore selected. 


lll, RELATIONSHIP OF FEED TO MIRROR 


The topics considered in this and later sections deal primarily with 
effects which influence the antenna radiation characteristics. It is 
therefore appropriate to comment briefly on the antenna range facilities 
for 18-GHz measurements. Pattern and gain measurements are made 
on a ground reflection range. The source antenna is located close to the 
ground so that the field illumination across the aperture under test is 
uniform within one decibel (field uniformity is frequently verified by 
probing the field in front of the antenna). Use of a pulse transmitter (with 
a pulse duration of 200 nsec) and gated receiver (which samples the peak 
amplitude during a 50 nsec interval on the leading edge of the trans- 
mitted pulse) assures a measured response free of spurious reflections. 
The dynamic measurement range at 18 GHz is 70 dB. 


3.1 Axial movement of feed relative to mirror 


The canister antenna was designed with the flexibility of allowing 
feed-paraboloid translation with respect to the mirror. As shown in Figs. 
10a and 10b, the feed and paraboloid may be moved in unison (providing, 
of course, that the feed is always kept at the focal point of the dish) with 
the electrical aperture of the antenna remaining fixed. Salient antenna 
dimensions are shown in Fig. 10c. 

Moving the feed with respect to the mirror has a pronounced effect 
upon the antenna radiation pattern. That angular portion of the radia- 
tion pattern for which the imaged paraboloid is visible, is influenced more 
by feed blockage than dish edge illumination (hence the effort to set feed 
taper for high efficiency). Minimum feed blockage is achieved by se- 
lecting a feed with small diameter, and keeping the feed from protruding 
well above the mirror where it would act as a scattering obstacle. On the 
other hand, a feed close to the mirror offers the potential for two dele- 
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Fig. 9—Dual mode feed: (a) photograph showing insect seal; (b) diagrammatic depiction of seal. 
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Fig. 10—Schematic depiction of antenna concept: (a) feed-paraboloid in lower-most position; (b) feed-paraboloid 
elevated; (c) major dimensions. 


terious effects. First, a feed too low will allow grazing illumination of the 
mirror near the hole around the feed, and concomitant pattern degra- 
dation. Second, experiments reveal that if the feed directly illuminates 
the mirror with excessive energy, then the illumination on the dish will 
be vertically asymmetric in amplitude.? This latter effect would also 
cause pattern degradation, possibly accompanied by deteriorated cross 
polarization discrimination. 

Optimum feed location was selected by an experimental study of the 
influence of feed location on antenna radiation patterns. Measurements 
were made at three frequencies (corresponding to the center and ex- 
tremities of the 17.7 to 19.7 GHz band) for all polarization states* and 
differing feed locations. The results, a typical example of which is de- 
picted in Fig. 11 as a smoothed radiation pattern,' clearly suggest that 
a low-profile feed affords the best performance. Therefore, the feed is 
placed low enough so as not to mask the horizontally directed plane wave, 
and yet not so low as to allow its spherical wavefront to illuminate the 
mirror too strongly. At its centerline, the feed axially protrudes above 
the top surface of the mirror approximately 2.08 centimeters. 


3.2 Influence of hole surrounding feed 


As shown in Fig. 2, the dual mode feed protrudes up through a hole 
in the mirror. The mirror itself is constructed of commercially available 
aluminum plate 0.953 + 0.013 centimeter thick. The surface is flat within 
0.038 centimeter peak over a 1.22-meter span. An aluminum frame is 
epoxied to the underside to inhibit the plate from sagging under its own 
weight. The hole in this mirror manifests itself as “feed blockage” and 
influences the antenna radiation pattern. 

A typical set of radiation patterns which exhibit the influence of this 
hole on radiation suppression is shown in Fig. 12. The three patterns 
correspond to a mirror with 4.06- and 5.08-centimeter projected diameter 
holes, and the hole around the feed carefully closed with conducting tape. 
As these patterns indicate, the presence of the open annulus caused 
raised sidelobes in the vicinity of 24° to 36°. Data acquired from ex- 
tensive experimental measurements generally suggests the benefit of 
making the annulus as small as possible. It is appropriate to add that the 
annulus cannot be made arbitrarily small since provision must be made 
for allowing tilt in the mirror. Therefore the hole is designed to have an 
elliptically shaped projection with a minor diameter no larger than 


* Horizontal polarization transmitted—Horizontal polarization received (HH), Hori- 
zontal polarization transmitted—Vertical polarization received (HV), Vertical polarization 
transmitted—Vertical polarization received (VV), and Vertical polarization transmit- 
ted—Horizontal polarization received (VH). 

+ Smoothed radiation patterns are prepared by the commonly accepted practice of 
drawing a smooth line across the peaks in the detailed pattern, thereby forming an envelope 
of peaks. 
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Fig. 11—Influence of feed position relative to mirror on azimuthal radiation pattern 
envelopes. 


necessary for feed placement and orientation, and a major diameter large 
enough to allow reasonable tilting of the mirror. 


IV. MICROWAVE ABSORBER AND ANTENNA RADOME 
4.1 Absorber 

The inside of the canister antenna is fully lined with microwave ab- 
sorber. Experimental measurements indicate that absorber is necessary 


for the suppression of wide-angle sidelobe radiation. 
In optimizing antenna performance, a variety of absorbers were con- 
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Fig. 12—Influence of feed-mirror annulus on azimuthal radiation pattern envelopes. 


sidered, including 1.91-centimeter thick convoluted foam, and 5.08-cen- 
timeter thick “hair” absorber. Not only must a suitable absorber provide 
radiation suppression, but it must be resistant to environmental dete- 
rioration. The antenna considered in this paper will provide no absorber 
protection aside from inhibiting direct exposure to the outdoor elements 
(sunlight, rain, ice, etc.). Consequently, absorbers for this application 
should be heat resistant, not inclined to rapid organic decomposition, 
and impervious to water. Relative to this last point, while the antenna 
interior will not be exposed directly to rain, it is expected that condensate 
can form on the inside as the antenna “‘breathes”’. 

It is useful to list here the respective advantages and disadvantages 
of the absorbers considered. Since much of the absorber inside the an- 
tenna will be subject to grazing incidence of electromagnetic fields rather 
than normal incidence, the convoluted surface would appear to be de- 
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sirable since experimentation has confirmed its excellent absorption 
qualities for varying angles of incidence. In contrast, “hair” has a rela- 
tively planar surface and might be expected to yield poorer performance 
for grazing angles. The convoluted absorber is made of an open-cell foam 
material while the “hair” is an open mat. Normally the “hair” absorber 
would have no tendency to hold water, while the convoluted foam could 
pick up water in a manner analogous to a sponge. To remedy this effect, 
the foam was covered with a sprayed Hypalon* coating approximately 
0.076 millimeters thick. Hypalon in itself is an excellent coating because 
of its electromagnetic transparency and proven resistance to weathering. 
However, the coating was easily perforated by fingers during installation. 
Finally, “hair” was significantly less expensive than the convoluted 
material, or even flat foam absorbers of comparable thickness. 

As a first step in assessing absorber performance, radiation patterns 
were made with no absorber, partial absorber, and a complete absorber 
lining of the antenna interior. The resulting radiation patterns using 
“hair” absorber are depicted in Fig. 13a. As this figure indicates, the 
addition of absorber dramatically reduces the wide-angle radiation of 
the antenna. The next step in the evaluation was to run separate patterns 
with the convoluted and hair absorber at three frequencies within the 
band for a variety of linear polarizations. The results, an example of 
which is shown in Fig. 13b, suggest that convoluted absorber generally 
affords slightly better performance. However, the differences are minor 
and because of the cost and weatherability advantages of the “hair,” it 
is preferred in this application. 


4.2 Radome 


Radomes have been traditionally used on high performance line-of- 
sight antennas used in the 4-, 6-, and 11-GHz common carrier bands. 
These radomes are called “thin” because the 0.762- to 1.016-millimeter 
fiber glass membranes are only 4/30 thick at the highest frequency. At 
frequencies near 20 GHz, preliminary tests on the canister antenna in- 
dicate that from a pattern standpoint, a satisfactory thin radome would 
have to be less than approximately 0.305 millimeters in thickness. Such 
a thin radome would prove structurally inadequate. For that reason, 
attention was focused on half-wavelength radomes. 

A radome used in the tests to be described was a solid laminate con- 
structed of epoxy resin and a low-loss fiber glass called E-glass. The di- 
electric constant of the laminate was thought to be 4.0 so the radome was 
made 0.399 centimeters thick for electrical tuning at 18.7 GHz. Subse- 
quent measurements indicated that the radome was actually tuned at 
approximately 18.3 GHz, inferring a dielectric constant of 4.3 (this latter 


* Registered trademark of E. I. duPont de Nemours & Co., Inc. 
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Fig. 13—Influence of microwave absorber on azimuthal radiation pattern envelopes: 
(a) presence or lack of absorber; (b) type of absorber. 


value is in accord with the commonly accepted value for this composite 
material). The loss tangent is 0.016. 

Figures 14a and b depict representative results obtained with and 
without the radome on the antenna. Figure 14a indicates that mea- 
surements made near the tuned point show the radome to have little 
influence on the antenna radiation patterns. This is also true of the cross 
polarized response. Figure 14b presents measured results near the edge 
of the band. For this case the radome does perturb the principal polar- 
ization wide-angle radiation characteristics, though the sidelobe sup- 
pression still meets the design objectives. The common explanation of 
this effect is that energy is reflected back into the antenna and subse- 
quently reradiated. Indeed, measurements made on this radome indicate 
that specularly reflected power is 35 dB down at 18.3 GHz, but only 13 
dB to 14 dB down at 19.7 GHz. 


V. MIRROR TILT AND FEED POSITION SENSITIVITY 
5.1 Mirror tilt 


Pairs of antennas used for point-to-point transmission are carefully 
oriented in elevation and azimuth to electrically point exactly at each 
other. For the canister antenna with relatively narrow beam width (3 
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Fig. 183— (continued) 


dB beam width of 1.33°), this orientation is performed by peaking the 
received signal while adjusting the azimuthal and elevation angle. The 
azimuthal orientation is accomplished by rotating the entire antenna 
in its supporting cross-arm (see Fig. 1). The elevation angle is adjusted 
by rotating the mirror about the horizontal axis so as to tilt the beam up 
or down. It is readily shown that as the mirror is tilted through the angle 
6m, the beam moves 4, = 26m. The overall influence of reflector tilt on 
gain and pattern is covered in this section. 

Reference radiation patterns were first measured with the mirror in 
the “nominal” 45° position (beam horizontal). Patterns were then 
measured with the mirror in a “tilt up” and “tilt down” position of ap- 
proximately 0,, = +£1.5°. In both cases the entire antenna was tilted 3.0° 
in the opposite direction to compensate for the beam tilt. This is done 
to maintain bore-sight beam orientation between the source and an- 
tenna. These measurements then establish the effect of the internal 
interaction of the plane reflector with other antenna parts. It should also 
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be noted that by tilting the antenna canister to compensate for reflector 
tilt, the patterns so obtained do not quite lie in the azimuthal plane, 
though the difference is slight. 

As expected, it was observed that tilting the mirror causes a decrease 
in received signal level, but this signal is completely restored by com- 
pensatory antenna tilt. Therefore, gain changes do not occur with modest 
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Fig. 14—Influence of half-wavelength solid laminate radome on azimuthal radiation 
patterns: (a) 18.0 GHz; (b) 19.7 GHz. 


beam tilt. The tilt experiments were done with and without a radome 
on the antenna, and for both polarization states of the antenna. Tilting 
the mirror is found to have almost no influence on the antenna sidelobe 
response. At most, 3 or 4 dB changes are noted at —55 dB levels near 30°. 
These small changes in sidelobe level are felt to be of little consequence 
because they are so far down from the main beam and do not change the 
angle at which the patterns drop beneath —60 dB. 
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5.2 Feed position sensitivity 


Feeds with diameters commensurate to a wavelength have phase 
centers located approximately in the plane of their aperture. Therefore 
the dual mode feed used in this antenna is positioned to have its aperture 
coincident with the focal point of the paraboloidal reflector. Neverthe- 
less, it is of interest to determine the sensitivity of feed position to deg- 
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Fig. 14— (continued) 


radation in antenna performance. Three conditions were examined: feed 
skew (feed axis inclined to paraboloidal axis of symmetry), feed linear 
displacement (feed and paraboloidal axis parallel, but feed axially or 
laterally displaced), and feed-polarizer twist. Briefly the tests show that 
up to 1° of feed skew, axial movement of up to 0.25 centimeter, and lat- 
eral displacement of up to 0.25 centimeter have little or no discernible 
influence. Feed-polarizer twist of up to 3° has no effect on gain, a mini- 
mal effect on pattern (first sidelobe increased 1 dB), but as expected, 
does deteriorate cross polarization discrimination. 
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These tests indicate that antenna performance is not overly sensitive 
to feed position, and allow for rather lenient support bracket tolerances 
which are easily maintained at minimal expense. 


VI. ANTENNA RETURN LOSS 


Transmitted power which is reflected back to the feed manifests itself 
as antenna return loss. The sources of reflected power in this antenna 
are: (1) feed mismatch including insect seal, (ii) parabolic dish, and (iii) 
radome. These items are individually treated below. 

Since the dual mode feed has a circular cross section and the polar- 
ization diplexer which will be used with this antenna was designed in 
square waveguide, a suitable transition was designed to connect the two. 
Such a transition requires the conversion of two dominant, orthogonal, 
TES modes in WS-42 to two dominant, orthogonal, TE, modes in WC-50. 
For this purpose, a four-inch linear tapered transition with no measur- 
able transmission loss and a return loss of better than 40 dB (reflection 
coefficient <0.01) was electroformed. Swept frequency return loss 
measurements on the feed with insect seal and linear transition were 
made, and are depicted in Fig. 15a. The return loss across the entire 
frequency range is better than 29 dB, corresponding to a VSWR <1.075 
or reflection coefficient <0.035. 

Return loss of the complete antenna was measured across the band 
of interest. Figure 15b shows this performance as a solid line for the 
antenna radiating into free space. The poorest return loss within the 
band is approximately 23 dB. A simple vector separation analysis, based 
on the assumption that the only contributions to the total returned power 
are the feed and paraboloid with radome, produced the dashed line as 
the contribution of the paraboloid and radome alone. Measurements 
indicate that with the mirror in its normal position, the contribution of 
the radome to total return loss is negligible since energy reflected back 
into the canister by the radome is not focused at the feed. This is still true 
with the antenna beam tilted down 3° since the reflected energy from 
the radome is 3° off boresight. 

The resulting dish contribution of approximately 28 dB agrees well 
with a computed value of 26.8 dB obtained from the equation 


4 
Return loss = 20 log ce (7) 


In this equation, f is the focal length of the paraboloid (0.536 meters), 
and G; is the feed gain. Feed gain has been computed as 12.9 dB at 18.7 
GHz by direct integration of the normalized feed radiation pattern. The 
equation above is derived by determining what fraction of power radiated 
by the feed, W;, is recaptured by the feed. The power density in the axial 
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Fig. 15—Return loss performance: (a) dual mode feed alone; (b) return loss with feed 
in canister antenna. 


region of the dish is Py = W;G;/4af?. This power density is reflected 
back toward the feed as a plane wave and captured with effective area 
Att = \*G;/4a. The power captured by the feed, W., is We = AerfPa = 
W,(AG;/4f)?, and eq. (7) follows directly. 


Vil. CONCLUSION AND SUMMARY OF ANTENNA CHARACTERISTICS 


The design and experimental optimization of a canister antenna are 
reviewed in this paper. The influence of antenna feed, parabolic reflector, 
radome, absorber and mirror on gain, radiation pattern, and return loss 
is considered. 

The measured gain of the antenna, virtually independent of polar- 
ization, is stated in Table I. These values of gain, measured without a 
radome in place, correspond to an approximate aperture efficiency of 
62 percent. The loss of the solid, half-wavelength-thick radome is 
0.4 dB. 

The principal and cross polarized response of the antenna is illustrated 
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Fig. 16—Principal (HH) and cross polarized (HV) smoothed azimuthal radiation pattern 
envelopes measured with radome installed: (a) 17.7 GHz; (b) 18.7 GHz; (c) 19.7 GHz. 


by the smoothed radiation patterns presented in Fig. 16. This figure 
presents the 17.7 GHz, 18.7 GHz and 19.7 GHz horizontally polarized 
response (HH) and the vertically polarized antenna response to a hori- 
zontally polarized transmitted signal (HV) with the half-wavelength 
radome on the antenna. Similar radiation patterns are obtained for the 
VV and VH polarizations. This is to be expected since the antenna design 
is essentially polarization independent provided the dual mode feed 
affords a balanced E and H plane illumination of the paraboloid. As 
noted earlier, such a balanced illumination is reasonably achieved. The 
principal polarization patterns include perturbation introduced by the 
radome. A comparison of these patterns with corresponding patterns 
measured without a radome reveals that sidelobe levels beneath —40 dB 
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are primarily affected, with the largest pattern deterioration occurring 
at the lowest radiation levels, and being most pronounced the further 
the frequency departs from 18.3 GHz (the “tuned” frequency for this 
radome). From Fig. 16, we also note that the on-axis cross polarization 
discrimination is in the mid to upper 30 dB range, confirming an estimate 
offered in an earlier paper dealing with this antenna concept.’ The in- 
fluence of mirror tilt and feed positioning is also assessed. It is shown 
that modest amounts of mirror tilt (e.g., +1.5°) and inaccurate feed 
positioning have minimal effect upon the antenna pattern and gain. 


CANISTER ANTENNA FOR 18-GHz OPERATION 631 


19.7 GHz 


—— PRINCIPAL 
POLARIZATION (HH) 


——-—CROSS 
POLARIZATION (HV) 


RELATIVE DIRECTIVITY, dB 





—60° —40° —20° 0 20° 40° 60° 


Fig. 16—(continued) 


Return loss measurements have been made on the complete antenna 
and individual components. The major contributors to reflected power 
are feed mismatch and the parabolic dish. The worst-case return loss 
of the feed with transition is approximately 29 dB. The return loss of the 
paraboloid is estimated to be 28 dB. With the mirror in its nominal po- 
sition, the radome contribution is negligible. The poorest total return 
loss measured (which occurs at a different frequency than the worst-case 
feed contribution) within the band is approximately 23 dB. 

As mentioned earlier, the antenna described in this paper was designed 
for an 18 GHz digital radio system. The particular antenna described 
herein was specifically designed to afford certain degrees of flexibility 
which would not be required after identification of those parameters 
which influence antenna performance. For example, the canister was 


632 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


oversized to permit installation of a variety of absorbing materials and 
so the paraboloid could be translated up and down relative to the mirror. 
With completion of the performance characterization, certain changes 
were made in the final antenna design. The final design will allow beam 
pointing of +8°. To accomplish this and assure adequate return loss, the 
radome will be fastened to the antenna in such a manner that the beam 
will not point within 1.5° of the normal to the radome surface. Perhaps 
the most significant change to be implemented in the final design is a 
larger paraboloidal reflector with a 0.864-meter diameter. This reflector 
(which necessitates a somewhat larger mirror) will afford approximately 
0.5 dB more gain. This increased gain is just slightly more than the 0.4 
dB loss introduced by the radome. It is expected that the antenna with 
a 0.864 meter dish will have a radiation pattern quite similar to that 
measured on the developmental model, and a total return loss of ap- 
-proximately 23 dB. 
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The ability to maintain linearly polarized output in single mode 
fibers is essential for utilization of polarization dependent receiver 
circuitry. Our measurements with long lengths of fiber (200 m) indicate 
that we can find input polarization angles which yield essentially 
linearly polarized output. However, we found that these polarization 
effects are greatly influenced by the presence of physical stress on the 
fiber such as stress due to bending, twisting, mounting, and other 
variations in ambient conditions. We conducted several experiments 
on short length fibers where special precautions were adopted to assure 
repeatability of the measurements. Our results indicate the existence 
of a general theoretical model that predicts the output polarization 
characteristics as a function of input polarization and fiber length. The 
model assumes the presence of two asynchronous, orthogonal modes, 
uniformly coupled over the entire fiber length. The model, however, 
cannot distinguish between uniformly coupled and uncoupled mode 
cases based on the output radiation measurements. 


1. INTRODUCTION 


The polarization characteristics of “single’’ mode optical fibers have 
been the subject of several previous publications.? An understanding 
of the polarization sensitivity of such fibers is important in assessing the 
applicability of polarization dependent optical circuitry.? An additional 
implication of polarization sensitivity is the introduction of delay dis- 
tortion in “single” mode fibers. 

In this paper, we present a theoretical model based on the propagation 
of two orthogonal modes; it is shown that based on the measurement of 
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the ellipticity of the output radiation alone, we cannot distinguish be- 
tween the following cases: 


(i) The existence of two orthogonal modes, uncoupled with different 
propagation constrants. 

(i) The existence of two orthogonal modes, with identical propagation 
constants and uniformly coupled by some means of periodic perturba- 
tion 

(iit) The most general case—two orthogonal modes, uniformly cou- 
pled, but with nonidentical propagation constants. 


The organization of the paper is as follows. In Section II, we summarize 
the theory, leaving the details to the Appendices. Section II describes 
the detailed experimental procedure utilized to measure the radiation 
ellipse as well as details of each of the measurements. In Section IV the 
experimental data is compared with simple theoretical results developed 
in Section II. 

It should be noted that the assumption of synchronous uncoupled 
modes will not verify our data. An additional finding of importance is 
that the most general case indicates that for a fiber of any given length, 
excitation conditions exist at the input that result in linearly polarized 
output. 


ll. THEORETICAL MODEL 


For completeness, we state the obvious: for synchronous uncoupled 
modes, the output radiation would be linearly polarized independent 
of the orientation of linear input polarization and fiber length. Experi- 
mentally, this is not observed and, hence, this model is ruled inappro- 
priate. 

As detailed in Appendices B and C, we assume spatially orthogonal 
modes whose propagation constants differ by A8; we further assume that 
the modes are uniformly coupled with a constant coupling x. The fiber 
output radiation ellipse,4 as described in Appendix A, would possess the 
following characteristics. If a and b are the amplitudes of the semi-major 
and minor axis components of the radiation ellipse, then their ratio is 


a 1+(1-— sin? 20’ sin? 2a]1/2 


R =5-—= 
b sin 20’ sin 2a (1) 
and the orientation y of the major axis of the ellipse is 
y = 1/2 tan! (tan 26’ cos 2a) — /2 (2) 


where & =6 + »/2 


636 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 





Fig. 1—Photomicrograph of single mode fibers, core diameter = 4.7 y, outside diameter 
= HL. 


§ =input orientation of polarization with respect to x axis 
n =tan7! 2x/AB 

a =1/2VAB2 + 4x22 

z =fiber length 


When R = 0 or ~, the output is linearly polarized. From eq. (1), we see 
that this happens at 6’ = +mz7/2, independent of fiber length. Thus, this 
model predicts that even in the presence of either phase asynchronism 
or mode coupling (or both), there are specific orientations of input po- 
larization, 


6 = —n/2 + mx/2 (3) 


for which linear polarization is observed for all lengths. The measured 
relative phase shift 2a = V A6? + 4x? z, can be thought of as due to an 
effective AG, that includes the effect of coupling such that 


ABe = V AB? + 4x? (4) 


In our measurements, the reference angle 6’ = 0 was always selected at 
first by rotating the input polarization to that angle for which the output 
was linearly polarized; then based on the observations of the radiation 
ellipse at the output, we cannot distinguish between the following cases: 
(i) asynchronous, uncoupled modes, (ii) synchronous, coupled modes, 
and (it) asynchronous, coupled modes. These cases are summarized in 
Table I. 


Ill. EXPERIMENTAL PROCEDURE 


Figure 1 shows a photomicrograph of the fiber when illuminated with 
white light. The fibers used had a core of pure SiOz and a cladding of 
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Case 


Asynchronous 
(AB ¥ 0) 
Uncoupled 
(x = 0) 


Synchronous 
(AB = 0) 
Uniformly 
coupled 
(x # 0) 


Asynchronous 
AB #0 
Uniformly 
coupled 
(x ~ 0) 


2a 


ABz 


2Qxz 


ABez = VAB? + 4x2 z 


Table I—Summary of cases 
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Fig. 2—Electron micrograph of etched single mode fiber. 
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Fig. 3—Experimental set-up. 


B203-6SiO>». The core and the outer diameters were 4.7 um and 133 um, 
respectively. Due to diffusion of B203 in the MCVD fabrication process, 
the fiber core was unintentionally graded in refractive index and the An 
was lower than expected for these compositions.® Figure 2 shows an 
electron micrograph of the etched fiber.® The effective core-cladding 
index difference was An ~ 2 X 107°. Fibers of lengths usually less than 
3 meters were used in our short fiber length measurements. 

The experimental arrangement is shown in Fig. 3. Light from a po- 
larized He-Ne laser is passed through a polarization rotator to facilitate 
rotation of the input polarization at the input of the fiber. The light was 
chopped to provide a reference signal at the receiver and then focused 
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on the fiber by means of a lens. A small section of the fiber near the input 
end was immersed in glycerol to remove any undesired cladding mode 
that might have been excited. The output light was collimated by a lens 
and passed through a Glan-Thomson analyzer before being detected and 
displayed in a PAR receiver. A Babinet-Soleil compensator could be in- 
serted and removed, as needed, between the collimating lens and the 
analyzer. 

Experimentally, we found the polarization effects on long and short 
length fibers were greatly influenced by the presence of any stress on the 
fiber and other changes in ambient conditions. To ensure repeatability 
of the measurement, it was necessary to take many precautions. The 
fibers were held as straight as possible by gently taping them on to a flat 
surface, at regular but not necessarily identical intervals to minimize 
inducing any stress in the fiber. The input end of the fiber was held by 
using a vacuum chuck except in one experiment where it was held by a 
mechanical holder. In that case, it was clamped first using as little 
pressure as possible and then taped. In all our measurements, the input 
polarization angle, which resulted in linearly polarized output, served 
as the input reference angle, i.e., #’ = 0. The analyzer position oriented 
to measure the cross polarized component served as the reference axis 
for the orientation of the output ellipse. If we alter the angle 6’, the output _ 
in general will become elliptically polarized. In order to obtain the phase 
difference 6 between components parallel and perpendicular to the 
reference angle 6’ = 0, we have to orient the Babinet-Soleil compensator 
parallel or perpendicular to the original reference axis of the analyzer. 
With the plunger of the compensator adjusted to obtain linear polar- 
ization at the output, from the plunger position and the calibration of 
the compensator, the phase shift 2a can then be calculated. In each 
measurement, where input polarization is rotated with a given fixed 
length of the fiber, or where the length of the fiber is varied keeping the 
input excitation angle 0’ fixed, the output analyzer was oriented parallel 
to the major and minor axis of the output ellipse to obtain the power ratio 
of these components. Orientation of the minor axis was also determined 
from the angular position of the analyzer. 


IV. EXPERIMENTAL RESULTS 
4.1 Fixed fiber length, input polarization varied 


In this experiment, an input reference angle was found such that the 
output polarization was linearly polarized; we then measured the ellip- 
ticity of the output radiation as a function of input polarization rotation 
about this reference. Figures 4 and 5 compare the experimental data with 
the theoretical results obtained by computing 20 log;9 from eq. (1) and 
evaluating the orientation of eq. (2). The best fit was for 2a = 59°, which 
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Fig. 4—Measured and theoretical power output ratio, R, of the major to minor axis 
components of output radiation ellipse as a function of input polarization angle (6’). Fiber 
length L = 2.39 meters. 


was also verified from the compensator measurement. However, this 
experiment cannot give a specified value for the phase retardation per 
unit length, i.e., 6/L(=A8,), since we can not determine explicitly the 
integral number of 7/2 phase shifts included in the entire length of the 
fiber. 


4.2 Fixed input polarization, variable length 


4.2.1 On axis excitation (0’= 0) 


In this experiment, after finding the reference axis, orientation and 
power ratio R of the output ellipse was measured; the fiber was shortened 
at the input end repeatedly by a small amount (~ 5 mm) and the ex- 
periment was repeated by reorienting the input polarization to obtain 
a linearly polarized output. From Fig. 6, we infer, as the theory predicts, 
essentially a linearly polarized output independent of the fiber length. 
The experimental limit of 38 dB indicated in the figure is the limitation 
imposed by the degree of polarization of the laser source. Our mea- 
surements on long lengths (~200 m) indicate the cross polarized com- 
ponent was down 32 dB. However, the output polarization was subject 
to severe variations due to physical stress and other changes in ambient 
conditions. 

The data (not shown) indicate the input polarizer angle and the output 
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Fig. 5—Measured and estimated relative orientation of the ellipse y as a function of 
input polarization angle 6’ for L = 2.39 meters. 


analyzer angle remained essentially unchanged with deviations, within 
experimental errors, indicating negligible angular rotation of the fiber 
with length. 


4.2.2 Slightly off-axis excitation (0’~ 6° ) 


This experiment was conducted by offsetting the input polarization 
by an angle 6’ (~6°). The input fiber end was held mechanically; the 
output end was successively cut and the measurements were repeated. 
As expected the output was elliptically polarized; as a function of length, 
the polarization changes from elliptical to linear and then back to el- 
liptical. From eq. (3) for a small angular offset, i.e., for small 0’ 


IR| = — (5) 


sin 2a 





and from (4) 
1 
y= 5 tan” [20’ cos 2a] 
= § cos 2a (6) 
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Fig. 6—Essentially linear polarization is observed at the output when input end is 
shortened with on axis (#’ = 0) excitation. 


Figure 7 shows the excellent agreement between theory and experiment. 
In fitting the y data, it was necessary to include a “twist” term of the 
order of 0.16°/mm. This implied a periodicity of the twist to be about 
2.25 m, which is very close to the length of the fiber and was traced later 
to the holding and mounting arrangement at the input end of the fiber. 
When the fiber was allowed to lie flat and straight, and the input end was 
held using a vacuum chuck, the twist disappeared. 

From the compensator measurements after each cut, the average AG, 
was evaluated to be ~0.0581 radians per mm. As seen from Fig. 7, linear 
polarization occurs at about every 54 mm and from eq. (5), it is obvious 
that this occurs when 2a = 0, 7, ... etc. Therefore 2a = ASL = xz with the 
result AG, = 0.0582 radians per mm, in agreement with the compensator 
measurements. This leads to an estimated value of effective index dif- 
ference 


A rN 
_ Abe, 


Ane = — = 5.86 X 10-6 
20 aL 


between the modes. Thus, it seems reasonable to state that the fibers 
cannot be easily fabricated to such tolerances and, hence, the polarization 
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Fig. 7—Analytical predictions that R varies as (6’ sin 2a)~!, and y varies as (0’ cos 2a) 


as a function of length for small 0’, confirmed experimentally. Period between successive 
linear polarization is 54 mm yielding a AB = 0.0582 radians/mm. 


problems observed with these experimental fibers will qualitatively be 
observed for all nominally circular, single mode fibers. 


4.2.3 Excitation at 0’= 1/4 


If the fiber was excited at 45° from the reference angle &’ = 0, two 
modes of equal amplitudes would be excited; at the fiber output they 
would be out of phase by 2a = AG,L. Therefore in this case, the polar- 
ization will change, as a function of length, from circular to linear and 
then back to circular with elliptical polarization in between. For 6’ = 7/4, 
from eq. (1) and (2), 





|R| = 4s (9) 
tan a 
and 
y= 47/4 (10) 


In eq. (4), when cos 2a goes through zero, i.e., at a = 7/4 and |R| = 1 
(circular polarization), y changes sign. Experimental results compare 
very well with the analysis as shown in Fig. 8. 


V. CONCLUSIONS 


The fibers used in these experiments were nominally circularly sym- 
metric; however, the data on the elliptically polarized output radiation 
indicates that very small deviations from circular symmetry possibly 
exist and that these irregularities cause serious polarization effects. It 
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Fig. 8—Behavior of output polarization with decreasing input length. R varies as (tan 
a)~! and y = £45°. The beat period is the same as before—54 mm. 


appears unlikely that the fiber fabrication process can be improved to 
the extent that such effects can be completely eliminated. 

The results indicate that input excitation conditions do exist for which 
linearly polarized output will be observed for a fiber of any length. 
However, the output polarization is subject to severe variations due to 
physical stress and other changes in ambient conditions. Our experi- 
ments can not determine whether any mode coupling exists. However, 

‘if any mode coupling exists, it must be uniform, at least over the few 
meter lengths of the fiber measured. 

Polarization behavior of single mode fibers may hopefully be im- 
proved, by making the cores purposely elliptical, provided that any pe- 
riodic perturbation such as “twisting” of the core is sufficiently small. 
and does not result in strong mode coupling. Large ellipticity, however, 
will increase the modal dispersion. Further study is necessary to char- 
acterize the polarization behavior of elliptical core fibers and, if they hold 
polarization, to arrive at an optimum degree of ellipticity that would 
provide a balance between polarization control and dispersion effects. 
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APPENDIX A 


Orientation and Ratio of Major to Minor Axis of Vibrational Ellipse of 
Elliptically Polarized Waves 


Consider two time-varying (S.H.) orthogonal electric field compo- 
nents 
Ex = ayes (wtts1) (11) 


Ey = ages wtts2) (12) 


such that a? + a2 = 1. The resulting electric vector, as seen from Fig. 9, 
traces, in general, an ellipse. The difference in the effective propagation 
constants between the two waves is 
By —B2_ 4 
Ag = = — 13 

B L L (13) 
where L is the fiber length. Whenever a, or dg is zero or when 6 = mz, 
where m is an integer, the output wave is linearly polarized. Furthermore, 
when a; = do, and 6 = (2m + 1)z/2, the output is circularly polarized. 
The ratio R of the semi-major to semi-minor axis of the ellipse can be 
defined as? 


= ; = tan x (14) 


where x is an auxiliary angle (—7/4 < x < 2/4). By choosing an angle #(0 
< ¢ < x/2) such that 


tan¢ =—2 (15) 
ay 


the angle y of the orientation of the resultant ellipse, with respect to a 
reference axis ox, and the parameters ¢ and x are related by* 


sin 2y = sin 2¢ sin 6 (16) 
tan 2W = tan 2¢ cos 6 (17) 

In addition, 
ai +az=a?+ db? (18) 


By eliminating y from equations (14) and (16), and with the use of eq. 
(15), we find 





pa gt a lbvi = 4dajaj sin* 5 (19) 
b 2a 1a sin 6 
and from eas. (17) and (15), 
1 a 2a a9 
=, tan 7 == cos | (20) 
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Fig. 9—Output radiation ellipse. 


APPENDIX B 


Relations Between Input Polarization Angle, Fiber Parameters, and Output 
Ellipse 


We assume a cartesian co-ordinate system, as shown in Fig. 10, with 
ox and oy coincident with the axes of slightly elliptical core or with that 
of the birefringent axes of the core of a single mode fiber as the case may 
be. Let us also assume that a linearly polarized plane wave is incident 
upon the fiber at an angle @ with respect to ox; therefore the amplitudes 
of the excited modes with orthogonal polarization are cos 6 and sin 0 and 
represent the x and y component respectively. 

Assuming a uniformly distributed coupling characterized by a coupling 
constant «x between the modes whose propagation constant differ by Af, 
the complex field amplitudes are given by’® 


E,, = cos 6 cos a — J cos (n + 8) sina (21) 
Ey = sin @ cos a +j sin (y + 6) sina (22) 

where the parameter a, proportional to the length of the fiber, is 
(AB/2)2 + x2 z (23) 


and the parameter 7 which relate the degree of coupling and the asyn- 
chronism between modes is given by 


tan 7 = 2«/AB (24) 
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jo- n/2, 0=1/2 — n/2 
(LP) 


0'=n/4 
(1/tan a) 





Fig. 10—Input excitation conditions and the behavior of the resulting output polar- 
ization. 


For reasons indicated in the text, we exclude the case when both x and 
AB are equal to zero. For the case x = 0 and Af finite, 7 equals zero. If AB 
= 0 and x is finite, 7 equals 7/2. When both x and A are finite, 0 < y < 
a/2. 

By rewriting eqs. (21) and (22) similar to eqs. (11) and (12), we see 
that 


a? = cos? 6 cos? a + cos? (y + 8) sin? a (25) 
az = sin? 0 cos? a + sin? (y + 8) sin? a (26) 

and 
a? — a3 = cos 20 cos? a + cos 2(n + 8) sin? a (27) 


The phase constant 6 is determined by 


2 sin(2 6+ 
pipes Ne pian (28) 
sin 20 — sin 2(0 + y) tan? a 


Substituting (25) through (28) in (19) and (20) we find, after considerable 
trigonometric manipulation, that the ratio of the major to minor axis 
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of the resultant output ellipse is 
_ 1+ [1 — sin? (20 + ») sin? 2a)!” 


R 29 
sin (26 + n) sin 2a ey) 
and its orientation 
1 
y= 5 tan” [tan (20 + 7) cos 2a] — (30) 


By appropriate choice of the sign, we see that |R| in eq. (29) is bounded 
by 1 and ~ or 0 and 1. Whenever || goes to 0 or ©, we have linear po- 
larization. For a given length of the fiber z, therefore a, this occurs 
when 


n T 

6 gh om a9 Dep ae (31) 

As 7 varies from 0 to 7/2, the input angle @ at which the output is linearly 

polarized varies from 0 to —7/4. In the most general case, at this specific 

angle, phase difference 6, as seen from equation (28), equals either zero 

or multiples of z and is independent of a indicating that the two modes 

are either in or out of phase with each other at the output for all lengths. 

This clearly shows for a given fiber with a finite x and AQ, there is always 

an orientation of input polarization that would result in a linear polar- 
ization at the output. . 

Note that in the special case n = 0, 6 = 2a and therefore, 6 is dependent 
on length; however, at 0 = +mz/2, as seen from (25) and (26), one of the 
components goes to zero. Thus the condition for input excitation angle 
6 to achieve linear polarization at the output, for all values of a, is given 
by (31) and holds good for all values of 7 in the range 0 < |n| < 7/2. 


APPENDIX C 
General Theoretical Model Used for Observation of Output Ellipticity 


In this Appendix, we consider the general case illustrated in the pre- 
vious Appendix and show that two orthogonal uniformly coupled 
asynchronous modes can be resolved into a new set of orthogonal modes, 
but completely uncoupled. This implies, even if any coupling exists, as 
long as it is uniformly distributed, our results can be explained by as- 
suming uncoupled, asynchronous modes. Obviously, the measured AG 
will now include the effects of coupling. 

If no mode coupling exists between the modes, i1.e., x = 0, then from 
(23) and (24) 


Q=--¢2 (32) 
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and 


n = tan7 Ge) =0 (33) 

Under these conditions, from (21) and (22), 
E, = cos e-J« (34) 
E, = sin deti« (35) 


We have shown in the previous Appendix that in the most general case 
0 < 7/2 < 47/2, there exists an input polarization angle such that 6 = 
—n/2, for which the output polarization remains linear for all lengths. 
As shown in Fig. 10, if we rotate the coordinate system by —7/2, then the 
angle #’ in the new coordinate system is 


0 =0+ 7/2 (36) 


But substituting (36) in (21) and (22), and writing the amplitudes along 
Ox’ and Oy’ as 


_E). = E, cos n/2 —- Ey sin n/2 (37) 
and 
E,, = Ey sin 7/2 + Ey, cos n/2 (38) 
Using eqs. (34) through (38) we can easily show that 
E,, = cos #’e-i« (39) 
E, = sin #/etse (40) 


Kas. (39) and (40) are identical to (84) and (35) with 0 being replaced by 
#’. Thus the coupled orthogonal modes can be easily resolved in terms 
of uncoupled orthogonal modes. Obviously, although a remains the same 
as in (23), it can be defined as in (32) to include the effects of the nonzero 


coupling as 
2 
a= Be, = 4/ (2) + x22 (41) 


By substituting (36) in (29) we find the ratio of major to minor axis of 
output ellipse is now given by 


_ 1+ (1 — sin? 20’ sin? 2a)? 
sin 20’ sin 2a 


R (42) 


Therefore excitation at the input polarization reference angle (#’ = 0) 
for which R — 0 or ©, the output remains linearly polarized for all 
lengths. Then if we vary 6’ at the input, eq. (42) represents the ratio of 
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axes of output ellipse. The orientation of the output ellipse is now given 


by 


1 
y= 5 tan [tan 20’ cos 2a] — n/2 (43) 
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Pair to Distributed Interference 
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We show that the longitudinal circuit defined by a well-balanced 
wire pair ina cable can be studied independently of the metallic circuit. 
The metallic circuit is then excited by the longitudinal voltage and 
current acting through the wire pair and terminal unbalances. Discrete 
parameter longitudinal circuits are defined which have the same ter- 
minal response as the distributed-parameter longitudinal circuit. These 
equivalent circuits are studied under an electrically short assumption, 
yielding simple expressions for their terminal response. The electrically 
short assumption enables the distributed impressed voltage which 
excites the longitudinal circuit to be represented by only two param- 
eters, the “total impressed voltage” and the “center of impressed 
voltage.” These parameters are analogous to the total mass and center 
of mass of a thin filament or wire. Finally, an analysis of a longitudinal 
circuit defined by a subscriber loop excited by a nearby power distri- 
bution system is used to derive a relationship between the short-circuit 
longitudinal current at the central office and the open-circuit longi- 
tudinal voltage at the telephone set. This relationship is used to esti- 
mate the distribution of short-circuit longitudinal current at the central 
office from a known distribution of open-circuit longitudinal voltage 
at the telephone set. 


I. INTRODUCTION 


The problem of computing the steady-state response of a multicon- 
ductor system has received considerable attention in the literature. 
Carson and Hoyt! developed the classical transmission line equations 
and S. O. Rice? developed the mathematical techniques necessary for 
their solution. Even so, the complexities introduced by a large number 
of conductors tend to limit the amount of basic understanding of fun- 
damental problems, such as the effects of longitudinal induction and 
longitudinal-to-metallic conversion, that can be obtained by pursuing 
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the multiconductor problem. Moreover, a large number of conductors 
are necessarily described by a large number of parameters upon which 
there is often a paucity of data. 

We present an in-depth analysis of the steady-state response of a 
well-balanced wire pair to distributed interference. This simplification 
of the general problem enables the analysis to continue beyond the for- 
mal solution to develop both an intuitive feel for the problem and a 
simple model for use in engineering applications. The effect of other pairs 
can be approximated in the one-pair model by a judicious choice of model 
parameters. . 

Historically, the primary emphasis has been on characterizing the 
longitudinal and metallic voltage at the subscriber’s telephone set be- 
cause of the impact of these voltages on the quality of the communica- 
tion’s path.©’ More recently, the use of electronic loop terminating 
equipment has generated interest in the longitudinal current at the 
central office. This study was motivated by the need to characterize the 
longitudinal current at the central office and to better understand the 
roles that the terminal and wire pair unbalances play in longitudinal- 
to-metallic conversion. 


ll. SUMMARY OF RESULTS 


We begin with the classical transmission line equations which define 
the steady-state response of the wire pair when excited by a distributed 
impressed voltage. A transformation to the longitudinal and metallic 
voltages and currents is employed to study the longitudinal and metallic 
circuits defined by the wire pair. We show that if the wire pair and its 
terminations are “well-balanced,” then the longitudinal circuit (LC) can 
be studied independently of the metallic circuit (MC). The MC is then 
excited by the longitudinal voltage and current acting through the wire 
pair and terminal unbalances. The unbalances admit the following in- 
terpretations: (7) longitudinal current flowing through the distributed 
impedance unbalance of the wire pair can be represented as a distributed 
series voltage generator in the MC, (11) longitudinal voltage across the 
distributed admittance unbalance of the wire pair can be represented 
as a distributed shunt current generator in the MC, and (ii) longitudinal 
current through a discrete impedance unbalance in a termination can 
be represented as a discrete voltage generator in the termination for the 
MC. 

The response of the LC is studied in some detail. Discrete parameter 
circuits are defined which have the same terminal response as the dis- 
tributed parameter LC. These equivalent circuits are studied under an 
electrically short assumption, yielding simple expressions for their ter- 
minal response. The electrically short assumption enables the distributed 
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Fig. 1—Incremental circuit model. 


impressed voltage to be represented by only two parameters, the total 
impressed voltage and the center of impressed voltage. These parameters 
are analogous to the total mass and center of mass of thin filament or 
wire. Finally, an analysis of the LC defined by a subscriber loop excited 
by a nearby power distribution system is used to derive a relationship 
between the short-circuit longitudinal current at the central office and 
the open-circuit longitudinal voltage at the telephone set. This rela- 
tionship is used to estimate the distribution of short-circuit longitudinal 
current at the central office from the known distribution of open-circuit 
longitudinal voltage at the telephone set. 


It. BASIC EQUATIONS 
3.1 Transmission line equations 


The transmission line equations for a wire pair in a cable excited by 
a distributed impressed voltage acting as a fixed radian frequency are 


vy(x) = —24(x)is(x) — zigie(x) + €1(x) 
iy(x) = —[yi(x) + yrelui(x) + yisvalx) (1) 
V(x) = —Zo(x)io(x) — z1211(x) + €o(x) 
ix(x) = —[yo(x) + yig]va(x) + yiev1(x). (2) 


Boundary conditions, discussed in Section 3.2, are determined from 
terminations at the ends (x = 0 and @) of the wire pair. The above 
equations are essentially Rice’s? eqs. (1.1) and (1.2) except for our 
notation which is motivated by the incremental circuit model of Fig. 1. 
Notice that three types of unbalances are possible at x; an impedance 
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unbalance [z3(x) # zo(x)], an admittance unbalance [y1(x) # yo(x)] and 
an unbalance in the impressed voltage [€1(x) ¥ €o(x)]. 

The metallic and longitudinal voltages and currents are defined as 
follows: 


v(x) = vy(x) — ve(x) = metallic voltage 


i(x) = ; [i,(x) — ig(x)] = metallic current (3) 


1 
Ug(x) = 5 [v1(x) + ve(x)] = longitudinal voltage 


lg (x) = 14(x) + to(x) = longitudinal current. (4) 


Transforming the transmission line equations [eqs. (1) and (2)] to the 
metallic and longitudinal voltages and currents yields 


v’(x) = —2i(x) + 62 (x)ig(x) + 6.(x) 


t’(x) = —yu(x) + by (x)vg (x) (5) 
v(x) = —Zglg(x) + 6,(x)i(x) + eg (x) 
ig(x) = —ygdg(x) + dy(x)v(x). (6) 


The parameters of eqs. (5) and (6) are defined in terms of the parameters 
of eqs. (1) and (2) and Fig. 1 as follows: The impedances in ohms per unit 
length are 


Z = 24(x) + 29(x) — 2z42 


Zg= jee) + z9(x) + 2219] 


52 = ; [zo(x) — 21(x)). (7) 
The admittances in mhos per unit length are 
a= ; [yi(x) + yo(x)] + vie 
Vg = yilx) + yolx) 
Syl) = 5 yale) — yal] (8) 
The impressed voltages in volts per unit length are 
ég(x) = slate) + e9(x)] 


b(x) = e1(x) — €o(x). (9) 
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x=0,2 
Fig. 2—Termination circuit model. 


Only the unbalances, which now appear explicitly, and the impressed 
voltages remain x-dependent; all other parameters are assumed uni- 
form. 


3.2 Boundary conditions 


The boundary conditions necessary to determine a particular solution 
to eqs. (1) and (2), or equivalently eqs. (5) and (6), are determined from 
the terminations at the ends of the wire pair. Consider the canonical 
passive-symmetric termination of Fig. 2. The boundary conditions for 
the voltages and currents relative to ground are 


V1 = —$1l1 — Sale 
Vg = —Sol2 — S911. (10) 


Transforming to the metallic and longitudinal voltages and currents [eqs. 
(3) and (4)] yields 


v=—fit Agz (11) 


The parameters of eqs. (11) and (12) are defined in terms of the pa- 
rameters of eq. (10) and Fig. 2 as follows: 


=o + fo + 2h 
1 
bg = ras + {9 — 2612) 


1 
Ay = 9 (2 — &). (13) 
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Fig. 3—Longitudinal circuit model. 


3.3 Longitudinal and metallic circuits 


The metallic and longitudinal voltages and currents supported by a 
wire pair are determined by eqs. (5) and (6) together with boundary 
conditions of the type given in eqs. (11) and (12) at the ends of the wire 
pair. Notice that if the wire pair and its terminations are perfectly bal- 
anced, then the metallic voltage and current are identically zero. Hence 
it follows by continuity? that if the wire pair and its terminations are 
well-balanced (i.e., the unbalances are small relative to their associated 
longitudinal parameters*), then the metallic voltage and current are 
small relative to the longitudinal voltage and ‘current. 

Under a well-balanced assumption, the second-order terms in eqs. (6) 
and (12) can be neglected, leaving 


V(x) = —Zgig(x) + eg(x) 
ig(x) = —ygbg(x) (14) 

with boundary conditions of the form . 
Vg = —Sglg. (15) 


Notice that the longitudinal voltage and current can be assumed inde- 
pendent of both the metallic voltage and current and the system unba- 
lances. Moreover, the circuit models of Fig. 3 which represent the above 
equations define the LC. 

Now assume that v,(x) and i,(x) are known in eqs. (5) and (11). Then 
the metallic voltage and current satisfy 


v’/(x) = —zi(x) + €(x) 
i’(x) = —yv(x) + &(x) (16) 


with boundary conditions of the form 


* Cable pairs have an average resistance unbalance of 2 percent and an average capac- 
itance unbalance to ground of 0.5 percent. 
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+ + 
Ay 
" yAx tAx = 
ig 
_ : a 
x xt+Ax 
x=0, 2 
Fig. 4—Metallic circuit model. 
where 
e(x) = 62 (x)ig(x) + 5,.(x) 
E(x) = 6, (x)vg(x). (18) 


are all known. The circuit models in Fig. 4 represent the above equations 
and define the MC. Note that the x-dependence of the wire pair unba- 
lances pose no analytical problems since they appear in the forcing 
functions «(x) and £(x). Moreover, superposition can be applied to yield 
a decomposition of the metallic voltage and current as a sum of terms 
due to each unbalance acting separately. 

Let us summarize our results. A well-balanced wire pair with well- 
balanced terminations defines a longitudinal and a metallic circuit. The 
LC can be assumed independent of the MC. The MC is excited by the 
longitudinal voltage and current acting through the wire pair and ter- 
minal unbalances which can be interpreted as follows: longitudinal 
current i,(x) following through the distributed impedance unbalance 
of the wire pair 6, (x) can be represented as a distributed series voltage 
generator 6,(x)ig(x) in the MC, longitudinal voltage u,(x) across the 
distributed admittance unbalance of the wire pair 6,(x) can be repre- 
sented as a distributed shunt current generator 6,(x)ug(x) in the MC, 
and longitudinal current i, flowing through the discrete impedance 
unbalance 6; in a termination can be represented as a discrete voltage 
generator Ad, in the termination for the MC. Finally, superposition can 
be applied to yield a decomposition of the metallic voltage and current 
as asum of terms due to each unbalance acting separately. The equations 
describing the LC and MC, since they have constant coefficients, are 
amenable to standard techniques® which are applied to analyze the LC 
in the next section. 
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IV. LONGITUDINAL CIRCUIT ANALYSIS 
4.1 Discrete-parameter equivalent circuits 


Equation (14) describing the response of the LC defined by a well- 
balanced wire pair is conveniently represented as a forced linear sys- 


tem, 
eye [ ee} os 
ig(x) yg OF Ligx) 0 
The boundary conditions defined by the longitudinal impedances ¢ o and 


¢ 4 of the well-balanced terminations at the ends (x = 0 and @) of the wire 
pair are, from eq. (15), 


Ug (0) = — ¢9,(0) 
vg(l) = Mig (2). (20) 
The solution of eq. (19) is of the form 


[re xa [2] + (ae-o [2] ae en 


Since the LC is assumed uniform (i.e., zg and yg, are independent of x), 
the transition matrix of the LC, &,(&), can be expressed in terms of the 
characteristic impedance kg = Vz,/yg and propagation constant yg = 
V Zgyz of the LC, 


cosh yzgé —kg sinh ygé 


®,(E) = (22) 


; cosh 7g 
—— sinh yz 
kg 


Substituting the boundary conditions [eq. (20)] and the above expression 
for ®,(£) in eq. (21) yields after some manipulation 


Ke cosh yg£+kgsinh yg ¢f BA 


i. ég(é) cosh yg(@ — &)dé 
So 


(23) 
£ 
ég(&) sinh yg(@ — &)dé 
The formal solution of eq. (23) for i,(0) and ig(@) is of the form 
E 
ig = ——_2— (24) 
aan Ae 
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x=0 X=R 


Fig. 5—Discrete-parameter longitudinal circuit models. 


where KE, is the longitudinal source and Z, is the longitudinal impedance 
seen looking into the LC from one end, terminated in ¢, at the other end. 
These quantities define two discrete parameter circuits (see Fig. 5) which 
have the same terminal response as the distributed parameter circuit 
of Fig. 3. 

General expressions for E, and Z, at each end of the LC are given 
below: 


e 4 
f, eg(é) | cosh VAC Se 7 sinh y,(£ — | dé 
Bo => —________“#_______- (28) 
cosh ygé + 7 sinh yg@ 
& 


© Cs 
f, €g (é) | cosh Ygi + sinh vat | dé 
0 Rg 


Ef= (26) 


0 
cosh yg + 7 sinh y,@ 
& 


e 
Zo = Sq + kg tanh yg (27) 


e 
1+-* tanh vy, 
Rg 


0 
_ §a+k, tanh ygl 
Zi = io ; (28) 
1+- tanh Vet 
Rg 
These expressions simplify if the LC is terminated in its characteristic 


impedance, ky. For example, setting ¢f = kg in eqs. (25) and (27) 
yields 


14 
EQ= fo eg(Gemretde 
Z) = hy. (29) 
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Note that the voltage impressed furthest from x = 0 suffers the most 
attenuation as one would intuitively expect. 


4.2 The electrically short longitudinal circuit 


An LC is electrically short if its electrical length, | yz4|, is small. This 
is typically the case when the source of impressed voltage is a nearby 
power distribution system. If |y,@| is small, then the hyperbolic func- 
tions can be approximated by the first terms of their power series ex- 
pansions; sinh ygf ~ ygf, cosh ygé ~ 1, and tanh ygf ~ yzf. These 
approximations when substituted into eqs. (25)—(28) yield approxima- 
tions of the longitudinal source and longitudinal impedance, 


2 1+ ¢fy,(@ — 
p= pitinet—2 (30) 
1+ Soyg8 
gs 
Ee = pra tevel (31) 
1+ Gy gl 
e 
2° = Sat 2k (32) 
1+ Cyl 
0 
Ze = Set zyl (33) 
1+ Sy 


The quantities E and @ are defined as the total impressed voltage, 
é 
B= [° g(ede (34) 


and the center of impressed voltage, 


a 1 £L 
Bas { geq(b)d8. (35) 


These definitions have a physical intrepretation if the impressed 
voltage is a real valued and nonnegative function. In this case, F = 0 and 
0 < 2 < £and the center of impressed voltage can be interpreted as that 
point along the LC where the impressed voltage may be concentrated 
without changing the terminal response of the Lc. Hence a particular 
E and @ define a class of equivalent impressed voltages whose canonical 
representative is a point source of strength E based at 2. These concepts 
have their mechanical analogs in the total mass and the center of mass 
of a thin filament or wire. Recall that the center of mass is that point 
along the wire where the total mass may be concentrated while preserving 
moments about the ends of the wire. In our case, the center of impressed 
voltage is that point along the LC where the impressed voltage may be 
concentrated while preserving the response at the ends of the LC. 
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V. ENGINEERING APPLICATION 
5.1 The longitudinal response of an electrically short subscriber loop 


The most common LC is a subscriber loop excited by a nearby power 
distribution system. This circuit is typically electrically short at the 
fundamental frequency of the impressed voltage (60 Hz). Longitudinally, 
a subscriber loop has a low impedance termination at the central office 
by virtue of the battery supply circuit, and essentially an open-circuit 
termination at the telephone set, assuming single-party or isolated ringer 
service. Consequently, the longitudinal quantities of primary interest 
are the short-circuit current at the central office, J,, and the open-circuit 
voltage at the telephone set, Vy. 

We now derive a simple relationship between these two fundamental 
quantities. To begin, J, can be expressed as 


f° 
Ig = 5G 36 
where BE? and Z ? are given for an arbitrary ¢ in eqs. (30) and (32). Letting 
e —> in these equations yields 


ap ===) p: a (37) 
2 = we (38) 
Substitution of these results into eq. (36) yields 
Iz = Eyg(€ — @). (39) 
Similarly, V, can be expressed as 
V,=Ef (40) 
where E f is given in eq. (31) for an arbitrary (?. Setting @ = 0 yields 
V,=E. | (41) 
Hence I, and V, satisfy 
Ig = Vayg(l — 2). (42) 


The link between V, and J, is the center of exposure, @. 


5.2 Estimating V, 


Historically, emphasis has been placed on characterizing the open- 
circuit longitudinal voltage at the telephone set because it can be con- 
verted to an interfering metallic voltage (V) across the telephone set if 
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unbalances are present in the loop.” The historical measure of loop 
balance® is 


BAL = 20 log “| dB. (43) 





The balance of the loop determines the amount of longitudinal voltage 
that will be converted to metallic voltage by the loop unbalances. This 
can be expressed mathematically as 


The metallic circuit noise (N) and the longitudinal circuit noise (Nz, 
usually called noise to ground) are defined as 





N = 20 log 7 dBrn (45) 
Vr 
Veg 

Ng = 20 log dBrn. (46) 
Vr 





The reference noise (dBrn) voltage is 24.5 1 V which corresponds to 1 pW 
across a 600 2 resistor. The commonly used 3-type noise measuring set 
attenuates a longitudinal noise measurement by 40 dB. Hence 40 dB 
must be added to a measured value to obtain Nz in dBrn. 

The distributions of noise and loop balance for Bell System loops were 
determined in 1964 as part of the General Loop Survey of physical and 
transmission characteristics of the loop plant.4 Noise measurements were 
made on 1100 randomly selected loops during normal working hours at 
the subscriber’s telephone set using both 3-kHz flat and C-message 
frequency weighting. C-message weighting approximates the frequency 
response of the telephone set and the human ear, and 3-kHz flat- 
weighting assigns equal weight to all frequencies in the 0 to 3 kHz band. 
Since the primary interferer on subscriber loops is a nearby power dis- 
tribution system with a fundamental frequency of 60 Hz, a 3-kHz flat 
measurement is dominated by the 60-Hz component. Hence the rms 
voltage corresponding to N, measured with 3-kHz flat weighting is es- 
sentially 60 Hz. The distribution of this voltage over the 1100 loops is 
shown in Fig. 6. The maximum voltage was 18 V. Ninety-nine percent 
of the loops had a measured voltage of less than 11 V. The average voltage 
was 1.5 V with a standard deviation of 2.1 V. 


5.3 Estimating I, 


The range of longitudinal current at the central office affects both the 
operation of existing loop terminating equipment and the design of new 
equipment. In this section, we estimate the distribution of longitudinal 
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OPEN—CIRCUIT LONGITUDINAL VOLTAGE 
(V rms) 


Fig. 6—Distribution of open-circuit rms longitudinal voltage at the telephone set (1964 
General Loop Survey). 


current using the distribution of longitudinal voltages and loop lengths 
determined as part of the 1964 General Loop Survey. 

The basic equation [eq. (42)| relating J, and V, can be expressed in 
the form 


[Ie] = |Vellye|@a — 2/2). (47) 


The ratio 2/@ is a measure of where the exposure is centered along the 
loop. If the exposure is centered at the subscriber (2/2 = 1), then the 
short-circuit current at the central office is zero. Conversely, the current 
is maximum if the exposure is centered over the central office (2/2 = 0). 
In this case, | 


max|Ig| = |Vg||ygl 4. 


Equation (47) can be used to estimate the distribution of |J,| from 
_the distributions of | V,| (see Fig. 6) and @ (see Fig. 2 of Ref. 4) and an 
assumed value of 2/2. The admittance of the longitudinal circuit is pri- 
marily capacitive at 0.17 uF per mi. Distributions of |I,| at 60 Hz for 2/€ 
= 0, 0.5, and 0.9 are shown in Fig. 7. In the worst-case condition of @/€ 
= 0, the maximum current is 12 mA. Ninety-nine percent of all loops had 
a current of less than 4 mA. The average current was 0.3 mA with a 
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Fig. 7—Estimated distributions of short-circuit 60-Hz longitudinal current at the central 
office (1964 General Loop Survey). 


standard deviation of 0.9 mA. If the exposures are centered at the mid- 
points of the loops (2/@ = 0.5), then the above currents are reduced by 
a factor of 2. If the exposures are centered near the subscriber (@/é = 0.9), 
then the above currents are reduced by a factor of 10, i.e., the maximum 


current reduces to 1.2 mA and the average current reduces to 0.03 
mA. 


5.4 Estimating €/2 


The estimated distribution of the short-circuit longitudinal current 
at the central office is very sensitive to where the exposures are centered 
along the loops. An estimate of 2/@ can be made from simultaneous 
measurements of |J,| and | V,| using another form of eq. (47), 


= 1 
7 (48) 
IVellyel? 


A limited number of near-simultaneous measurements were made as 
part of the CO Strata Survey described in Ref. 5. Single near-simulta- 
neous measurements of |I,| and | V,| at 60 Hz were made during normal 
working hours for each of the 47 test loops. These data were not discussed 
in Ref. 5 but were made available to us by D. N. Heirman. The ratio @/£ 
was calculated for each test loop using eq. (48). The distribution of the 
calculated values over the 47 loops is shown in Fig. 8. All exposures were 
centered beyond the midpoints of the loops. The average value of @/£ 
was 0.88. Hence the exposure of the average loop was centered at almost 
90 percent of the loop length. These results are intuitively pleasing since 
the worst exposures (long aerial parallels with single-phase) are indeed 
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Fig. 8—Distribution of @/é at 60 Hz (Central Office Strata Survey). 


over the far ends of the loops. If this distribution of. £/£ is representative, 
then the worst case condition is not 2/2 = 0 but 2/@ = 0.5. In this case 
(see Fig. 7), the maximum current at 60 Hz. is 6 mA, 99 percent of all 
loops had a current of less than 2 mA and the average current was 0.15 
mA. 


5.5 Use of the data 


The data presented in this section are based on one-time measure- 
ments made during normal working hours as part of the 1964 General 
Loop Survey. The demand for commercial power has doubled since 1964, 
and one would expect the induced voltages to have increased as a result. 
In addition, recent surveys® have shown that time-of-day variation in 
60-Hz induction is likely to be 2 to 1. In residential areas in particular, 
the induction is likely to be higher in the evening than during the day 
when the 1964 measurements were made. For these reasons, the reader 
is cautioned to use the data presented in this section with care. 
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An Inversion Technique for the Laplace 
Transform with Application to Approximation 
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Properties of a sequence of positive operators defined by the Widder 
Laplace inversion formula are studied in order to obtain practical 
methods for the inversion of the Laplace transform, practical error 
formulae, and useful approximations to given functions. The approx- 
imation procedure retains essential structural characteristics of the 
original function, e.g., nonnegativity, monotonicity, and convexity. 
Thus a distribution function is approximated by distribution functions. 
Enhancement techniques are provided for the improvement of accuracy 
_ for a given order of approximation. The methods are illustrated by 
applications to renewal theory and to the covariance and recovery 
functions of telephone traffic theory. 


Il. INTRODUCTION 


The Laplace transform occurs frequently in investigations of queueing 
theory and telephone traffic models in which it usually represents a 
probability distribution function. Although the mean and variance of 
the distribution can be readily obtained from the transform, there are 
many investigations in which the distribution itself is needed; in par- 
ticular, good analytic and numerical approximations for the comple- 
mentary distribution when the argument is large. This is the case, for 
example, when studying waiting times of queues, time delays of work 
through a computer system, and delays of message progress through data 
networks. 

Numerical methods which have been made thus far!®-!8 concentrate 
on accurate numerical approximation on some interval [0, T], the diffi- 
culty of accurate inversion increasing with increasing T. Methods de- 
pending on Gauss-Legendre quadrature applied to the defining Laplace 
integral with subsequent interpolation are discussed in Ref. 19. These 
methods require the solution of large order linear systems whose matrices 
are severely ill-conditioned; thus they can bog down in meaningless 
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calculations. Much ingenuity has been used in specific cases to circum- 
vent this problem. Asymptotic formulae may sometimes be used to ap- 
proximate the complementary distribution for large argument; however, 
in many practical cases, good accuracy was obtained only when the 
argument was so large that the corresponding probabilities were too small 
to be of practical significance. One of the methods of this paper, namely, 
the a enhancement procedure, specifically attacks this problem by im- 
itating the exponential decay of the original in [T, | while simulta- 
neously providing accurate approximation in [0, 7]. The transition region 
is sufficiently well approximated for most practical uses. 

The well-known Laplace inversion formula of Widder!? has not been 
actively used in practical work. It has been the experience of the author 
that an investigation of the Widder formula qua functional transfor- 
mation can provide useful practical techniques for inversion and also 
inequalities and limit relations between the approximations and the 
original function. Accordingly, it is the object of this paper to study the 
properties of a sequence of positive operators defined by the Widder 
formula in order to obtain practical methods for Laplace inversion, 
practical error formulae, and useful approximations to given functions. 

In II the Widder inversion formula is obtained and a sequence of 
positive operators, L,,, which form the subject of the paper, are intro- 
duced. The L, map a function f(t)(¢ = 0) to a sequence of functions f,, (¢) 
= L,f which converge uniformly on [0, ~] to f(t). This viewpoint enables 
one to study the approximation characteristics of the sequence f, (¢), thus 
providing a means of approximating a given f(t) besides effecting the 
inversion of its Laplace transform, f(s). Several representations are given 
for f,(t) in terms of f(t). 

In III properties of the sequence {f,, (t)} are developed which show that 
it possesses many desirable characteristics. In many applications it is 
preferable that the approximating functions globally imitate the original 
function in qualitative structural features rather than to the attainment 
of very high numerical accuracy. Thus if the original function lies be- 
tween zero and one, is monotone decreasing, and is convex, then these . 
same properties would be desired in the approximation. It is shown that 
the approximating sequence, {f,, (t)}, does retain those properties. A re- 
cursion relation for f,,+1(t) in terms of f,,(¢), and a generating function 
for the sequence are also given, thus making the computation of higher 
approximations possibly more convenient than the direct application 
of the representation formulae themselves. A useful feature of the f, (t) 
is that, when f(t) is convex, they satisfy f,(t) = f(t). 

Part IV develops error bounds and pointwise error estimates. The 
results in terms of f(t) reflect the use of the technique for approximation; 
on the other hand, the pointwise estimate of error in terms of f, (t) is 
especially useful for the inversion problem since then f(t) is not available. 
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It is also shown that the successive approximations fo(t), f1(t), fo(t),... 
are uniformly better for each t if f(t) = 0. 

In practical use the initial member of the sequence, fo(t), is not an 
accurate approximation to f(t) for ¢ not in the neighborhoods of zero and 
infinity. Additionally the sequence {f,, (t)} does not converge rapidly in 
n. Consequently one must go far out in the sequence to obtain adequate 
accuracy. Part V treats this problem. A modification, fn,.(t), of fn(t) is 
introduced depending on a parameter a for which, by appropriate choice 
of a, fo,.(t) is a much improved approximation to f(t) than f(t); the 
rapidity of convergence of the sequence {f, ,, (t)} is not improved over that 
of the unmodified sequence. However, it has been found that good ac- 
curacy is obtained by use of fo,4(¢) or f1,.(t) as is demonstrated in the 
examples on covariance and recovery functions given in this paper. 

In many applications, especially to complementary distribution 
functions, the behavior of f(t) for large t must be accurately reproduced. 
The approximations f,;, .(t) accomplish this especially when a is related 
to the decrement of an exponential majorant. For functions which are 
exponentially small at infinity, the f,,(t) do not adequately reproduce 
the decay of f(t). 

Many of the desirable features of the original method are still retained 
by this modification. The concept of convexity with exponent a is in- 
troduced which allows the transference of the inequality /,(t) = f(t) to 
fn,a(t) 2 f(t). A criterion is given for deciding convexity with exponent 
a in terms of the transform, f(s). 

The degree of precision concept is applied to the approximation se- 
quence in order to obtain a modified sequence, s,,(t), which converges 
more rapidly. For sufficiently smooth functions this method is successful. 
The approximation s,,(t) consists of a linear combination of fo(t),.. . , 
fn(t) or of fo,a(t),---, fn,a(t) and hence is easily applied. Its efficacy is 
demonstrated in the examples of this paper. Unfortunately the im- 
provement in rapidity of convergence is so strong that the map from f(t) 
to s,(t) is no longer positive, consequently many of the desirable struc- 
tural preservation properties of the L,, are lost in favor of greater nu- 
merical accuracy. 

An attempt is made to enhance the rapidity of convergence of {f,(t)} 
while simultaneously retaining the positivity of the map. This is ac- 
complished by the construction of a new sequence, h,,(t), which is also 
a linear combination of fo(t),... , f(t). As is to be expected, however, 
the improvement is not as great as is realized with the sequence s,,(t). 

The pointwise error estimate developed in Part IV may be used as a 
correction device on f,, (¢) or fn,~(t) to improve further the accuracy of 
computation. This, however, in the absence of an error estimate for the 
modification, must rely on one’s understanding of the specific problem 
for ascertaining the reasonableness of the result. 
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An application of the methods of this paper to the renewal function? 
in the theory of renewal processes is made in Part VI. The remarkable 
accuracy of the simplest of the approximations fo(t), f1(t) is notewor- 
thy. 

Part VII presents applications of the techniques to the covariance and 
recovery functions of Erlang blocking models used in telephone traffic 
theory.!! For the covariance function, the initial approximation, fo,.(t), 
is excellent; however, in the case of the recovery function it was found 
that fo,a(t), f1,¢(¢) might not be considered sufficiently accurate, ac- 
cordingly the linear combination, s;(t), was used. This provided suffi- 
cient enhancement of accuracy. 

The generating function, G(z,t), for the f, (t) can sometimes be used 
to obtain an explicit construction of the sequence. Some examples of this 
nature are treated in Part VIII. 

Applications of the methods herein have been made to the comple- 
mentary distributions of waiting time in M/G/1 queues. Also B. W. Stuck 
and EK. Arthurs have successfully applied these techniques to the study 
of models of computer systems. 

There are questions of an exclusively mathematical character which 
have not been touched upon, e.g., a semigroup interpretation and satu- 
ration phenomena. It is felt that these would be outside the essentially 
practical thrust of the paper. For some theorems which are applicable 
to the operators of this paper see Ref. 5. 

A short table of operations on f(t) and their corresponding maps under 
Ly is included to facilitate application of these methods to the con- 
struction of approximations. 


ll. WIDDER INVERSION—REPRESENTATIONS 


Let the transform f(s), 
fls)= { e-flu)du (1) 
exist for s > 0, then 
(CD on+izin)(s) is f- e-suunt(u)du (2) 
n! n! Jo 


in which 
Fins) = Fis), (3) 
Ss 


The function (s"*1/n!)e—s4“u" is a probability density function on (0, 
o) for s > 0,n = 0 whose mean is (n + 1)/s and variance (n + 1)/s2. When 
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s = (n + 1)/t, the mean and variance are t and t?/(n + 1) respectively. 
One has:! 


Theorem 1 (Widder). Let the transform, /(s), of f(t) exist for s > 0, let 
f(t) be continuous at t and bounded on [0, ~], then 


ji cm grt fln)(s) = f(t). 


nm—> oo n. s=(n+1)/t 





The convergence is uniform in every finite closed interval throughout 
which f(t) is continuous. 


Proof. Korovkin’s theorem on sequences of positive functionals.” 


The inversion theorem, in the above form, had already been stated 
by Feller? who used the law of large numbers to effect the proof. It is the 
purpose of this paper to study the transformation 

— 1)" ~ 
Laf = fn = ont Hft(5) 
n 


(4) 





s=(n+1)/t 


so that f, may be effectively used as an approximation to f. The repre- 
sentation of f, directly in terms of f is obtainable from (2); thus, 


fn(t) = f, gn(t, u)f(u)du (5) 
n+1 
gn(t, u) = Be onlin Ditlgn n= 0. (6) 


Alternative forms which will be found useful are: 





n+1 & u 
f(t) = f, v(n, (+1) =) flu)du (7) 
alt) =(n +1) f "Wn, (n+ Lu) f(tu)du (8) 
O 
Sy 
V(x,a) =e Tet) (9) 
in which y(x, a) is the Poisson probability distribution, 
. t du 
fr(t) = f, M,, (-) flu) = (10) 
M,, (x) = BEI ening ant (11) 


in which the representation is by means of convolution on the half line 
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(Mellin convolution), and 


Enln) =f Knlndg(n ~ dé (12) 
Kn (n) = oe ea (ntl )n-(nt+1je™ (13) 
t= er, u = e& f(t) = a(n), falt) = n(n) (14) 


in which the representation is by means of convolution on the whole real 
line (Fourier convolution). 
The conditions of Theorem 1 are relaxed below. 


Theorem 2. Let the transform, f(s), of f(t) exist for s > c, let f(t) be 
continuous at ¢t and let f(t) = O(e°!)(t — @~); then, 
lim f,(t) = f(t). 


The convergence is uniform in every finite closed interval throughout 
which f(t) is continuous. 


Proof. The representation (5) may be written as follows: 


1 n+1 foe) 
fn(t) _ a f, e—[n—m+))/tluyne—m/tuf(y)du, (15) 
n!} 0 


For all ¢ in some finite closed interval, m may be chosen so that 
e-(m/t)uf(u) = 0(1)(u > ~) 


hence Korovkin’s theorem is again applicable and the conclusions fol- 
low. 


lll. PROPERTIES OF f,(t) 


Jensen’s theorem applied to (5) proves. 
Theorem 38. f(t) is convex on (0, ~) 


= f(t) <frlt),t 20,n= 0. 


The value of an approximation method is greatly enhanced when the 
approximating function preserves the shape of the original and coincides 
closely with its behavior in the neighborhoods of zero and infinity. 
Theorems 4, 5, 6 establish the desired properties. 


Theorem 4.a < f(t) < b= a Sf,(t) S 634, b arbitrary real. 


Proof. Direct evaluation shows that 


Lnf =f, f= at Bt. (16) 
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The positivity of L,, implies 
f<b=>Lyf S$ Lynb = bLy1 = b. (17) 


Similarly for the lower bound. 


The derivatives of f,,(t) may be related to those of f(t) through use of 
(8); thus 


Theorem 5. Let f(t) be continuous and O(e°')(¢ — @), then there is 
an m so that f(t) exists and is continuous for n => m and 


(t) =(n +1) {, W(n, (n+ L)u)urf(tu)du. 


One may set m = Oif f(t) = 0(1)(t — ~). 


Proof. For m sufficiently large, the integral of the theorem converges 
uniformly in t; hence the representation (8) may be differentiated under 
the integral sign r times. If f(t) is bounded then m = 0 is a permissible 
choice since one still has uniform convergence. 

Corollary 1.f9 =0 3 f" =0,n =m. 

Proof. This follows from the positivity of the kernel. 


The above corollary implies that if f has a continuous derivative and 
is monotone then f,, is monotone, and if f has a continuous second de- 
rivative and is convex then /f,, is convex. A stronger structural result will 
be obtained in Theorem 6. One also has that if f is completely or abso- 
lutely monotonic then f, is completely or absolutely monotonic re- 
spectively. 


Corollary 2. f(O+) = Anrf(O+), n = m, 


T(n+r+t 1) 
Meer 
ni(n+ 1)" 


In particular 
fr(O+) = f(O+) n=m 
frx(O+)=f(0+) nem. 
Proof. Define Xy,- by 


har =(nt1) ii Wn, (n + 1)w)urdu 


then evaluation of the integral yields the formula stated. Since the op- 
erator is bounded, the limit statements follow. Also one has \n,o = An,1 
= 1. 
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Corollary 3. Let f(() < ©; then 
fO() = Anrf (2) n20 


In particular 


ful) =f(m) nn 20, 
jn() =f(~)  n 20. 


Proof. Dominated convergence allows the interchange of limit and in- 
tegral. 


The following concepts will be needed to establish further structural 
properties. 

For an arbitrary sequence in (—~, ~), —© < tj, to,...,t¢< ~, the 
number of changes of sign is called the variation of the sequence and will 
be indicated by v(t, ta,..., tg); thus 


v(38, —1, 0, 2, -2) =3 (18) 
v(1, 2, 4, 6) = 0. (19) 


One sets v(0, 0,..., 0) = —1. Let f(t) be defined on (0, ~), and let 0 < t, 
<to<...< tg < © be an arbitrary, ordered sequence in (0, ~), the 
quantity sup v(f(t1), f(t2),..., f(t¢)), in which the supremum is taken 
over all sequences, i.e., for all choices of (¢1, t2,...,¢¢) and for all @ = 1, 
is called the variation of f and will be indicated by v(f). A transformation 
L on a given class of f will be called variation diminishing if and only if 
v(Lf) < v(f) for every f in its domain. The definition used here is adopted 
from Hirschman and Widder.4 
Let $(n) be a frequency function on (—©, ~), that is, 


wn) =0, Ff da)dn=1 (20) 
and let 
Hs) = fend. (21) 
Define E(s) by 
E(s) = $(s)71. (22) 


Then a theorem of Schoenberg‘ states that the transformation 
Tg= { dWetn—DdE — geBC(-#, ©) (23) 
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is variation diminishing if and only if 


E(s) = e~Cs?+bs TT (1 7 ~) es/ak (24) 
k 


arp 
C = 0, b, ay real, © 1/az < ~. 
k 
The designation geBC(—~, ~) means that g(y) is bounded and contin- 
uous on (—©, ~). It may be observed that the mean of ¢ is b and the 
variance 


2C + > 1/az 
Rk 


The Laplace transform of K,,(n) (13), 


Rls) = fo e-Kalnddn (25) 
is 
Kyle) = (26) 


This may be written in the following forms 


“ _Td ts) 2 s 
cae a (n + 1)§ pur (1 : z) 27) 
I —-1 = psyn - = —s/k 
Ry (st = ee TT (1 ra z) e (28) 
je Sa aes (29) 
j=1J 


in which y = 0.5772157 is Euler’s constant. Thus by Schoenberg’s the- 
orem, the transformation T,g = gn, geBC(—@, ~), defined in (12) is 
variation diminishing. The mean and variance of K,, are respectively vy, 
as given above, and o2% given by 


(30) 


Since the map t = e” is monotone, the following theorem has been es- 
tablished. 


Theorem 6. The transformation L,, defined on feBC (0, ~) is variation 
diminishing, i.e., 


v(fn) S v(f). 


Corollary. f, does not cross any straight line more often than f. In par- 
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ticular, if f is monotone then f,, is monotone, and if f or —f is convex then 
fn or —fn is convex. 


Proof. From (16) and Theorem 6, 
v(Ln(f — a — Bt)) = v(Lnf — a — Bt) Sv(f—a— Bt). (81) 


Clearly f is monotone = v(f — a) < 1 for arbitrary a, and f or —f is convex 
= u(f —a— Bt) S 2 for arbitrary a, B. 


It is clear from (8) that the approximating sequence to f(at)(a > 0) 
is f, (at); this may be expressed in a more illuminating way as follows. 
Define the operator A by 


Af(t) = f(at) a>0O (32) 
then A and L, commute; thus 
L,Af = AL, f (33) 


Hence the eigenfunctions of A, which are t’, are also the eigenfunctions 
of Ly. In fact one easily obtains 


Lpt” = Ant’, r 2 0,n = 0. (34) 

It may be observed that if L,, is defined by (8) instead of by (4) then (34) 
remains valid even for r < 0 provided n is large enough. 

Other operations with the same eigenfunctions will also commute with 


L,. Of importance in discussing the convergence of f(t) is the opera- 
tor 


d=t— (35) 
One has 
Theorem 7. The operators L,(n = 0), 6 commute; thus 
Ly (Of) = 8"fn- 


It is assumed that the rth derivative of f exists and is continuous on (0, 


oo), 


Proof. It is observed that the eigenfunctions of 6 are t”; alternatively the 
result follows directly from (8). 


Corollary. Proceeding inductively, one can now establish that if f is 
continuous and O(e“‘)(t — ~) then 


lim f&) =f 


no 


In addition to shape preserving properties, another way of assessing 
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the adequacy of an approximation process is by comparison of moments; 
the rth moment of f(t) is here taken to be JSot’f(t)dt. The Mellin 
transform, f(s), given by 


i(s) = f, ” es-If(t)dt (36) 


is the appropriate tool. Since the transform of M,,(s), eq. (11), is 


_ (n+ 1)sT(n —s4+1) 


M,,(s) “A 


(37) 


one has from (10) 


(n+ 1)sT(n-—s+1)- 


fn(s) = f(s). (38) 
n: 


For the following, it is convenient to use the factorial symbol 


nO=1n™=n(n-1)...(n—rt+1),r>0 (integral). (89) 


The following theorem may now be stated. 


Theorem 8. Let the rth moment of f(t) exist (r = 0, integral); then the 
rth moment of f,(t) exists for n > r and one has 


© “ hie yet © : 
f, Ufa (t)dt = § tf (t)dt. 


n+) 


Special cases are 


f. faae =" fF" poae nea (40) 

0 n 0 

ie tf, (t)dt estes tf(t)dt n>=2. (41) 
0 n(n — 1) Jo 


When fot~1f(t)dt exists, an interesting special case of (38) occurs for 
s = 0; thus 


f, t-lf, (t)dt = J, t-l¢(t)dt. (42) 


Formulae (40), (41) may be used to ensure equality of moments. Thus 
if it is required that the zeroth order moments agree, then, according to 
(40), one may use as the approximating sequence nf, (t)/(n + 1). If the 
zeroth and first moments are to agree simultaneously, one may use a 
linear combination of f, (t) and fm(t); for example, %f3(t) — 2/sfo(t). 

Another set of moment relations may be obtained from (7) involving | 
sums of f,((n + 1)t). These are given in 
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Theorem 9. Let the rth absolute moment of f(t) exist; then 
Y nf, ((n + 1)t) = 6-7-1 f, “ urf(u)du. 
n=0 0 


Proof. One has 


s n(n, a) =a" (43) 
n=0 
and, from (7), 
+” w 
fn((n + 1)t) = #7? f, y (n, *) f(u)du. (44) 


Multiplication of both sides of (44) by n“) and summing, the result 
follows on interchanging summation and integration. Dominated con- 
vergence justifies the interchange. 

Another property of f,,(t) as a function of n is given in: 


Theorem 10. Let f(t) = 0 on (0, ~), O(e°')(t — ~), then there is an m so 
that 


n+2 
t) < at 
fn+i(t) € n¢i 





n+2 
) fa(t),t =0,n =m. 


Proof. Since 











n+ 2\nt2 
(n + 2)Win + 1, (n + 2)u) = ue" ( 7 ) (n + 1)W(n, (n + 1)u) 
n 
(45) 
one may write 
2 + 2v\ n+2 
fn+i(t) = (n + 1) i W(n, (n + 1)u)ue—% & ) f(tu)du. 
0 n+1 
(46) 
Observing that ue~¥ < e—!, the inequality follows. 
Corollary. f(t) 20—= fat) is monotone decreasing in n for allt = 0,n 
zm, 
Proof. One has 
fosilt) < fnft) 9-1 (+ _—n (47) 
n+2 n+l Rie ee 
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The result follows since 


en! (- a = <1, (48) 
n+l 





A stronger monotonicity property of f, (t) is stated in Theorem 21. 

A useful recurrence relation allowing one to compute the members 
of the sequence f,, (¢) successively starting with f(t) is given in the fol- 
lowing theorem. 


Theorem 11. 
n+1 t . sgnt+il 
= t) +—— t),n>0,t20. 
fn+lt) = fn Ges: ) n+!" ear ) “ 


Proof. Define f(s) by 














f(s) =" gut ifion(s) (49) 
n! 
then, by (4), 
fr(t) = f,(s) (50) 
s=(n+1)/t 
One has 
s hubs) = (n+ 1)fn(s) — (n+ 1)fn+1(s) (51) 
fr+r(s) = fas) -—— 8 = fal) (52) 
Thus 


1 d 
n+1 . ds f(s) | con 


The recurrence relation is now obtained on performing the substitution 
for s. 





Oe | in(s) es (53) 


A useful alternative method of presenting the structure of the entire 
sequence {/,,}9 in terms of fp is by means of a generating function. This 
is given in the theorem below. 


Theorem 12. f(t) is bounded on (0, ~) => 








© 1 t 
E 2"ial(n+ Dt) = fo G — -) , 


The series is convergent for |z| < 1 and analytically continuable in the 
half plane Rez < 1. 
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Proof. The formula follows from (7) after interchange of summation and 
integration. The series is clearly convergent for |z| < 1 while dominated 
convergence justifies the interchange for Rez < 1. 


The case r = 0 of Theorem 9 provides the following corollary. 


Corollary. 
WiLOs\ lin = fo ( : ) = et Pfu). 


z-1-1—2z 1-—z 








The Mellin transform, f(s), of f(t) may be directly obtained from its 
Laplace transform, f(s), by use, for example, of (38) for n = 0; thus 


5 fo(s) 
= —_—_e 54 
paar aes (54) 
Accordingly one may now write (38) in the form 


(n+ 1)sT(n+1-s)- 


ie) = rasa 068 (55) 


or, equivalently, 


in(s) = (n + 1) (“ & _) fo(s). (56) 


At times (55) or (56) provides a convenient alternative to Theorems 11 
and 12 when f,(t) is required as a function of n. 

The range of applicability of the Jensen inequality of Theorem 3 may 
be extended by use of Mellin or Fourier convolution. A sequence {f,, (t)}"—0 
will be called an approximation sequence if there is an f(t) so that f, (t) 
= L,f(t). Let « designate Mellin convolution; then 


Theorem 13. f,*g is the approximation sequence for fxg. 
Proof. One has 
Ln (f*8) = Mnf8 = Life (57) 
thus, | 
Ly (f*g) = (Lnf)*8 = fn*8. (58) 
The converse of Theorem 3 is also true. 


Theorem 14. f,(t) = f(t) for alln = 0, t = 0, f(t) is bounded on (0, ©) = 
f(t) is convex on (0, ~). 


Proof. The result follows from Theorem 8 of Karlin and Ziegler.® 


Corollary. f convex on (0, ~), g = 0 on (0, ~) = fxg convex on (0, ~). 
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Proof. One has from Theorem 3 
fn =f (59) 
and, since g = 0, 
fn*8 = fx8. (60) 


Since, by Theorem 13, f, «g is the approximation sequence to f*g, ap- 
plication of Theorem 14 proves the corollary. 


It may be observed that the inequality of (60) remains valid when + 
is interpreted as Fourier convolution although, in this case, f, *g is not 
the approximation sequence for fxg. 


Another set of convexity results may be obtained from (8) by consid- 
ering logarithmic convexity. 


Theorem 15. If f(t) is log-convex on t = 0, then f, (t) is log-convex for 
n2=0,t 20. 


Proof. Equation (8) and the additivity of log-convex functions.® 
Further one may state the following inequalities. 
Theorem 16. If f(t) is log-convex on t = 0 then 
F(t) S ebnénf) < f,,(¢). 


Proof. The inequality on the left follows from Theorem 3 applied to 
énf(t); the one on the right is a consequence of the geometric mean- 
arithmetic mean inequality. 


IV. ERROR ESTIMATION 


Error estimates take different forms depending on the class of func- 
tions for which they are intended and whether or not they are bounds 
or pointwise estimates. From a practical point of view the pointwise es- 
timate is the most useful provided it may be easily evaluated in terms 
of the approximation itself. The next three theorems provide error 
bounds for different function classes; the fourth theorem provides an 
approximate formula for the pointwise evaluation of error, while (112) 
does the same but in terms of f,, (t). The error of approximation, ¢, (¢;f), 
is defined by 


éen(tsf) = fn(t) — F(t) (61) 
Theorem 17. Let f(t) be continuous on (0, ~); then 


len(tsf)| S wae f(e)|. 
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Proof. One has 


én (tif) = f, galt, u)tf(u) — f(é)}du (62) 

len(tsf)| < §,  gn(t, u) flu) — f(t)}du (63) 

len (ésf)| < sup MOL- f° got, u)lu ~ tld (64) 
t>0 0 


7 ro) / 
Jen(tif)| $ sup ot -| Go galt, u)(u — 0)2du 65) 


len(tsf)| < ow [f(e)|. (66) 


The last inequality follows because the mean and variance of g,,(t, u) 
are t and t2/(n + 1) respectively. 


Theorem 18. Let f(t) be continuous on (0, ©); then 





len (tsf)| S SRE \f(t)|. 


Proof. The Taylor expansion of f(u) about t has the form 
f(u) = f(t) + (t — u)f(t) + = x — u)?F(é) (67) 
in which é lies between t and u. Thus 


én(t;f) = 70), £«(0,~). (68) 


The inequality of the theorem now follows. 

The next theorem provides an error bound which is uniform for t¢[0, 
co]. For this purpose the absolute first moment of K,(n) (13), an, is 
needed; thus 


an= f Knl|aldn, (69) 
Theorem 19. Let f(t) be continuous on (0, ©); then 


| en (t3f)| < ap sup |tf(t)]. 
t>0 


Proof. One has from (12) 


enlersf) =f Kula ~ ste) ~ g(n)ide (70) 
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len(e™sf)] < sup lel» f Knln- Dln- ele (71) 
<n< —« 


Simos 
| <n (tsf)| Son sup [tf(t)|. 
t>0 
Corollary. The convergence of f,,(t) to f(t) (n > ~) is uniform for t[0, 
oo], 


Proof. It is necessary to show that 


lim a, = 0. 


n— oo 


The expression for a, (69) is rewritten as follows 


an= pn f KD)" nldn (72) 
1 n+i1ynti 
nN. e 


Use of the power series expansion for e~” yields 


On ™ Pn J. e~(n+D/2)77I nl dn S 2 ( 
— co en! 


Stirling’s formula now shows that 


An © V2. (75) 


Some numerical values of a, are a = 1.0160, a; = 0.6388, ae = 0.5006, 
ag = 0.4247, ag = 0.3751, as = 0.3396, ag = 0.3126, a7 = 0.2911; the as- 
ymptotic formula (75) is sufficiently accurate for n > 7. 

To continue the study of ¢,(t;f), it is useful to obtain an explicit for- 
mula of Peano type, that is an integral transform of f. 

Let 





ye 
e 


(74) 


X+-=x x20 (76) 
=0 x<0 (77) 
then the Taylor expansion of f(t) with remainder is 
f(t) = (0) + HO)E+ fe —v)4flv)dv. (78) 
From (5) and (16), one has 


fn(t) = f(0) + f(O)t + f, * F(v)do f "galt, u)(u —v)du. (79) 
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Thus 
en(t, f) = f, ” E(t, v)f(v)dv (80) 


E,(t, v) = f ” galt, v)(u — v)du — (t — v)+. (81) 


The kernel E,,(t, v) (the Peano kernel for error representation) is, 
clearly, 


E,(t, v) = Ly(t — v)4 — (t — v)+. (82) 


The explicit evaluation of the kernel may be most simply carried out by 
means of (24) since the Laplace transform of (t — v)+ is e~*"/s2. Let 


Sn(x) = = (83) 
j=0 J} 

and 
£n(a) = e~*V2[S, ((n + 1)a) — aSp-1((n + 1)a)] (84) 

then 
L,(t — v)4 = te, (a) a=v/t (85) 
En(t, v) = t[@,(a) — (1 — a)4]. (86) 

In particular, one has 

E(t, v) = te7*/t — (t —v)4 (87) 
Ex(t, v) = (t + v)e—2¥/t — (t — v) 4. (88) 


Since (t — v)4 is a convex function of t for each v, (82) and Theorem 
3 establish 


E,(t, v) = 0 for all t = 0, v = 0. (89) 
The moments of the kernel, E,, (t, v), may be obtained by substituting 


the functions f(t) = t” (r = 2) into (80), and using (34) and (61) for 
evaluation of ¢,(t;t”); the following is obtained: 


S, v'E,(t, v)dv = 
0 


Anrt+ Qn 1 


ee ert 2 
C-4DeLo)° r20. (90) 


In particular 





§, = Cia (91) 
grown Q2nt+1 
oo t3? 8n +5 

En t, d = —_ ©. 2 

Jo eBalt vdde 6 (n+ 1)? a 


One may now obtain an approximate evaluation of ¢, (t;/). 


686 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


Theorem 20. Let f(t) be continuous on (0, ~) and 0(e°')(t > ©); then 
there 1s an m so that 


t2 47 8n+5 
tj) * t n2m,; 
en(tsf) maria a) Ura 


also if f(t) is convex, then the approximation is a lower bound. 








Proof. The one point Gaussian quadrature formula for {9 E(t, v){(v)dv 
is of the form Af(a) in which the constants, A, a, are determined by re- 
quiring the quadrature to be exact for all linear functions. Use of (91), 
(92) now yields the formula of the theorem. The inequality follows from 
the nonnegativity of E,,(t, v) (89) and Jensen’s inequality. 


Since by the Corollary to Theorem 7, f,(t) approximates f(t), in 
practice the required value of f(t) is approximated either from the an- 
alytic form of f,,(t) or numerically from a table or curve already com- 
puted for f, (t). 

At this point another property of the sequence {f,(t)}p can be 
proved. 


Theorem 21. Let f(t) = 0, continuous on (0, ~), and O(e°)(t — ~); then 
there is an m so that 


fn+1(t) S fn(t) for allt = 0,n = m. 


Proof. Clearly the monotonic decreasing character of f,, (t) as a function 
of n will hold if ¢,(t;f) has the same property. The nonnegativity of f(t) 
and (80) shows that the result is implied if E,,(t, v) is monotonically 
decreasing in n; in turn, by (86), this will follow if 2, (a) is monotonically 
decreasing in n for each a = 0. From (84), by direct calculation, 


d 
— £,(a) = -e~@tDas) ((n + 1a) (93) 
da 
d? = nie et ee nee 
da2 on - ! aha wets. (94) 
Let 
hn(a) = €n-1(a) — £n(a) n21 (95) 
then, from (94), 
d2 d? 
dag? in) = rn(a) Ja? 21) (96) 
1\ntl 
rni(a)=1- (1 + =) ae~¢, (97) 
n 
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It is clear from (94) that the sign of 


d2 
a2 hy (a) 
is the same as that of r,, (a). There exist two points 0 < ag(n) < a(n) with 
the following properties: 


rn(a) 20 0<a <a)(n) (98) 
rn{a) <0 ag(n) <a <a,(n) (99) 
rn(a) =0 a2=aj,(n). (100) 
Since 
d 
é,(0) = 1,— @,(0) =—-1, n20 (101) 
da 
it follows that 
h,(0) = 0, n, (0) =0, n2l. (102) 
da 
One has the following integral representations for h,,(a): 
a 6 2 
nla) = [db (rule) 5 Ln-ile)de, (103) 
0 0 dc? 
pe ab ele pela 
n(a)= f dbf rnle) 5 Ln-aledde, (104) 
Thus (98) and (103) imply 
h,(a) = 0 0<a <ap(n); (105) 
similarly (100) and (104) imply 
h,(a) 20 a =aj,(n). (106) 


The function h,(a) cannot be negative in (ao(n), a;(n)) since then it 
would have at least one local minimum; however, (99) shows that in 
(ao(n), ay(n)) 
2 
da? h,(a) <0 
which is a contradiction. Thus 

h,(a)20,a20,n 21 (107) 
and the theorem is proved. 


It is possible to estimate conveniently ¢,,(t;f) directly from f,, (t) if f, (t) 
and /,,(t) are readily obtainable, at least possibly numerically from values 
of f,, (t). From (38) one has 


688 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


f(s) = Mn(s)“fn(s). (108) 


Expansion of I'(n + 1 — s) into a power series in s and substitution into 
(37) provides the following series 


Thus 








— rae os 2 
Mn(s) = 1+ vps +=2 Bs? +... (109) 
a P30 
M,,(s)71 = tape" ; ae ae (110) 
oe _ 
€n(s;f) = E x oped ae et 9 +. 7 hi (111) 
) aemeey 3 
én(tsf) © —vn Of n(t) Hae Oke), (112) 


To facilitate the use of (112) some values of the coefficients are given 


in Table I. 


The following readily obtained asymptotic formulas may be used for 
values of n beyond the table: 





1 1 
n ®t — 113 
me On +1) 12(n + 1)2 ee) 
1 1 
2 
ries + 
3 n+1 2(n+1)? 
i oe 1 al 





~ a 
2 2n+1) 8(n +1)? 


V. ENHANCEMENT OF ACCURACY 


The excellent behavior of the operator L, in constructing approxi- 
mations to a given f(t) which preserve its structural properties and its 
limiting values and which provide inequalities exacts a penalty in the 
form of slow convergence. A high value of n is required to attain high 


CADPR WNHKH OS [FS 


Table | — Coefficients 
Vn of (o2 — v2)/2 
0.5772 1.6449 0.6559 
0.2704 0.6449 0.2859 
0.1758 0.3949 0.1820 
0.1302 0.2838 0.1334 
0.1033 0.2213 0.1053 
0.0856 0.1813 0.0870 
0.0731 0.1535 0.0741 
0.0638 0.1331 0.0645 
0.0566 0.1175 0.0572 
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numerical accuracy. In many practical problems, fortunately, very high 
accuracy is not needed; notwithstanding, the value of n required may 
still be inconveniently high. Considering that one starts with the Laplace 
transform, f(s), of f(t) and uses (4), or constructs fo(t) and uses the re- 
cursion of Theorem 11, a high value of n implies obtaining a corre- 
spondingly high order of derivative of f(s) or of fo(t) which can be a 
time-consuming operation. Thus it would be useful to modify the basic 
approximation, L,f, while still preserving many of its original charac- 
teristics so that the accuracy for a given value of n may be increased. 

In many cases the transform, f(s), has the property that for some a 
> 0, f(s — a) converges for s > 0. This property is used to construct a new 
approximation, fn. (t), defined by 


Fnolt) = e~“'L, (e*f(t)) (114) 
and, correspondingly, a new operator 
Ln,al = fra (115) 


The following theorem permits the construction of f,,.(t) directly from 


fn(t). 








Theorem 22. 
ewat t 
fr,a(t) = ae | 
(1- at a 1- at 
n+1 n+1 
Proof. From (5) and (6), one has 
(n+1)"t+1 pe 
Lf = ee f, ent D/tluy nf (y)du (116) 
ee A ee n+1)/tlutau,,n 
Ene) = fi erintvielebeuunfu)du. (117) 
Thus | 
t 
Unf | iat/n+) 
at \ntl(n+1)7*1 co 
= Page Se —[(n+1)/tlu+tau,,n 
(a as - nine f, el tlutauynf(y)du. (118) 
Comparison of (118) with (117) shows that 
1 t 
Le!) (119) 








(1 7 at i= at 
ned n+l 
hence, the result follows from (114). 


690 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


The approximations f,, .(t) satisfy theorems similar to those proved 
for f,(t); however, modifications are required. Only Theorem 3 will be 
discussed. A function f(t) will be said to be convex over an interval with 
exponent a if and only if e“‘f(t) is convex over the same interval. 


Theorem 23. If f(t) is convex on (0, ~) with exponent a then 
f(t) S fra lt). 
Proof. One has from Theorem 3, 


evf(t) < Ly(ef(t)). (120) 


The result now follows from the nonvanishing of e“! and (114). 
The error of approximation by f,,..(t) will be designated €p,.(t;f) and 
defined by 


én,altsf) = fralt) — F(t). (121) 
Clearly 
én,altsf) = e~*en(tse“'f). (122) 
also, if the condition of Theorem 21 is satisfied, one has 
En,a(t;f) = 0. (123) 


One of the useful aspects of the approximation, fn,.(t), is that it more 
accurately reflects the asymptotic behavior of f(t)(t — ©) than f,, (t) does 
for a given value of n. In the later applications this will be an important 
characteristic. 

Clearly, ordinary convexity corresponds to convexity with exponent 
zero; however, the following theorem relates convexity with exponent 
a to log-convexity. 


Theorem 24. Let f(t) be continuous on some interval I; then f(t) is log- 
convex on J if and only if it is convex with exponent a on J for all a. 


Proof. One has 
én (e“f(t)) = at + énf(t) (124) 


hence e“‘f(t) is convex with exponent a on J for all a if f(t) is log-convex 
on J. The derivative condition for convexity with exponent a on J is 


f(t) + 2af(t) + a2f(t) = 0 onl (125) 
and the derivative condition for log-convexity on I is 
f(t)f(t) — f(t)2 = 0 on I. (126) 


The choice 
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a = —f(t)/f(t) (127) 
which is always possible since f(t) > 0 on J, in (125) verifies (126). 


Convexity with exponent a and, hence, by Theorem 24, log-convexity 
may be decided by means of the Laplace transform and the use of the 
Hausdorff-Bernstein theorem.’ 


Theorem 25. Let f(t) be continuous on (0, ©), then f(t) is convex with 
exponent a on (0, ~) if and only if 


(s + a)2f(s) — (s + 2a)f(0+) — (0+) 


is completely monotonic in s on (0, ~) and is absolutely convergent on 
s>0. 


Proof. The expression cited is the Laplace transform of 
suet? 
ert 7) (e'f(t)) 


whose nonnegativity is the necessary and sufficient condition for con- 
vexity of f(t) with exponent a. The Hausdorff-Bernstein theorem now 
completes the proof. 


It may be observed that the quantities {(0+), f(0+) are obtainable 
from 


lim sf(s) = f(0+) (128) 
lim {s?f(s) — sf(0+)} = f(0+). (129) 


Another method of enhancement is related to the concept of “degree 
of precision.” An approximation operator T, i.e., Tf ~ f, in which the 
functions t’, suitably restricted to an appropriate interval (r = 0, inte- 
gral), are in its domain, is said to have degree of precision k if Tt” = t" 
forO <r <k and Tt’ # t" forr=k + 1. Thus the singular operators L, 
studied here have degree of precision one. . 

The enhancement method consists of the following: coefficients 6; (0 
<j <k — 1) are determined by the moment conditions 


k-1 
> 6L;(t") = t" O<r<k (130) 
j=0 
and, accordingly, the linear combination 
k-1 
sealt) = 5 fil (131) 
l= 


is now taken to approximate f(t). Clearly the map from f to sz-1 has 
degree of precision k, however, unfortunately, it is not positive. If f is 
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sufficiently smooth there will result a significant improvement in ac- 
curacy over the use of f,-; alone. The system (130) may be expressed 
in terms of \,,, as follows 


k-1 
yo Aj =l  1srKk. (132) 
jJ=0 


Accordingly special cases of (131) are 
81 = —fo + 2f; (133) 


s2= 5 fo- Afi + 5 fa (134) 


A method of enhancement consisting of a linear combination of the 
fn(t) similar to (131) which, however, retains the positivity of the map 
will now be constructed. The accuracy attained will usually not be as 
great as that of (131) for a given set of values {f;(t)}§. The new sequence 
will be designated h,,(t) and is defined by 


hp (t) = 2 pifi(t). (135) 
Define W,,(u) by 
Wale) = ¥ ii + DG, G+ Yu) (136) 
then the coefficients, p;, are constrained by 
= pj = 1, Wa(u) = 0 for all u > 0. (137) 
= 


Theorem 26. 


1/2 
_ Pi 
|hn(t) — f(t)| S t sup f()| pe + i} 


Proof. From (8), one has 


hy(t) = f, ” Wr, (w)f(tu)du. (138) 
Also, from (137), 
| f, ” Wy(u)du = 1 (139) 
hence 
hn(t) — ft) = f° Wa w)[f(eu) ~ f(@)]du. (140) 
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The nonnegativity of W,,(u) now permits the following inequality 
lin(t)— FOL S f° Walw)lfltw) — fO)|dus (14) 


hence, 
Jrn(t) ~ f(6)| St sup [f(0)| f W,(u)|u —1|du. (142) 
The Cauchy-Schwartz inequality applied to (142) yields 
Inn(e) — f(0)| <tsup FOL { ” We(uyu = 1)8du) (4a) 


Evaluation of the integral in (143) provides the inequality of the theo- 
rem. 


Consider the sum, S, defined by 
S=y Ae (144) 
j=0J) +1 


then, in order to obtain the best approximation, the p; must be chosen 
to minimize S besides satisfying the conditions of (137). One has 


j+1 . : 
eH Wu) = 3 AO pye-Ha (145) 
=0 J: 
Let 
z=el-vy (146) 
then the constraint of (137) may be written 
n +1)tl ; : 
SE ving =0 O<z<l. (147) 


Define the polynomials P(x) by (147) with z = (x + 1)/2; then one 
has 


n k+1 
Payee xi[ SEE (eto | 4s) 
J=0 k=j k} J 
and 
P(x) 20 -l<x <1. (149) 


The cosine polynomial P(cos 8) is now obtained and written in the 
Fourier form 


P(cos @) = ao + z a; cos j0 (150) 
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in which the a; are obtained from 


oe ff. ” P(cos 8) cos jod0. (151) 
T —W 
The nonnegativity of P(cos 6) implies the following representation® 
P(cos @) = |h(0)|? (152) 
h(0) =xo + xye®@+...+x,e%”? (153) 


in which the coefficients xo, ..., X, are real; thus 
n-J : 
ap= 2 >> x Maj O<j <n. (154) 
v=0 


When (154) is solved for the p; in terms of xo, ..., Xn, the problem of 
minimizing S subject to pp +... + py, = 1 becomes that of minimizing 
a quadratic form relative to another quadratic form. 

The optimum p; have been obtained for the case n = 2; the result is 


ho(t) = 0.146993f9(t) — 0.944260f,(t) + 1.797267fo(t) (155) 
with S = 0.273952. Thus, from Theorem 26, 


‘|ho(t) — F(t)| < 0.523404¢ sup |f(z)|. (156) 
t>0 


The estimate of ¢,,(¢;f) in (112) may be effectively used to reduce error. 
One may take as an approximation to f(t) the following 


o _ y2 
f(t) © fn(t) + rn Ofn(t) ey OP Fatt). (157) 


In order to improve fn,.(t), the approximate calculation of €n,4(t3f) 
proceeds by use of (122). The practical use of (112), (157) uses difference 
quotients to evaluate Of, (t), 62f,(t) from the values already obtained 
for f,, (t). Thus, let h > 0 be the distance between consecutive values of 
t for which f,,(t) is calculated; then 


, fat +h) = fn(t —h) 


Ofn(t) ~ oh 


(158) 
fri(t a3 h) ea 2fn(t) + frlé ~~ h) 


O?fn(t) ~ Ofn(t) + t? 2 


(159) 


The following comment should prove useful in reduction of error. If 
a function g(t) is known which approximates f(t), for example, the 
leading term of an asymptotic expansion for f(t), then one may use 


f(t) ~ g(t) + Lalf —g). | (160) 
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Evidently an appropriate g(t) should always be sought before con- 
structing practical approximations to f(t). 


Vi. THE RENEWAL FUNCTION 

In this section some of the preceding theory will be applied to ob- 
taining approximations for the renewal function, M(t), of a renewal 
stream.? Let A(t), with A(0+) = 0, be the interarrival time distribution 
and A(s), given by 


A(s) = f, e~stdA(t) (161) 
its Laplace-Stieltjes transform, then 
: 1 A(s) 
= 162 
M(s) 1206) (162) 


The sequence of approximations, M,,(t), may now be constructed 
from 


1 


M(t) = 1-da/p — 1. (163) 
In particular one has 
—_ 1, 2 A’(2/t) 
Mil 1— A(2/t) : t [1 — A(2/t)]? en 
in which 
A’(s) = © Re: (165) 
ds 


Let A be the arrival rate, and o? the variance of interarrival time, that 
is, 


-1 = 7 
nN §, tdA(t) (166) 
o f, t2dA(t) —X (167) 


then evaluation of the contribution of M(s) at s = 0 provides the 
term 


o7\2 — 1 


At + 168 
Z ee . 
Thus one may introduce a new function, f(t), by 
2\2-—] 
M(t) = pr f(t) (169) 
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with 

A(1/t) o2\2 — 1 
p=. none 

1 — A(1/t) 2 

Since linear functions are invariants of the operators L,,, there is no re- 

duction of error when approximating f(t) by f, (t) over approximating 

M(t) by M,, (t); however, often f(t) is exponentially dominated and the 

enhancement technique of Theorem 22 is applicable. 

The following example will be considered: 


(170) 


ee 1 
A(e) = ext V/* WO eae (171) 
~ 1 1 
Ms) = -—————. 
(s) $V1+2s-1 nee) 
Thus, | 
1 
MN) = asa (173) 
M(t) ol ee ee ee (174) 
I’ VT4F4E- 1) t V1 4 4/1 + 4 - 1? 
Since \ = 1, a? = 2, one has 
M(t) = 0+ 5+ fe) 
ea 2 Sees eee eS 
WO VT +4 N-1 tt VEE 4/1 + 4 - 2? 2 
(175) 


The a transformation of Theorem 22 may be applied to f;(t). As- 
suming /;(t) to be ultimately of one sign, the singularity farthest to the 
right of f(s), namely —, coincides with the abscissa of convergence; 
hence, a = 4. Table II compares the approximations for M(t) given by 
M(t), the enhancement procedure of (133), and t + % + fi1/o(t) with 
more accurate values obtained from the exact solution 


co 1 t/2 
M(t) = eran f, e-Myn-1/2¢_yy. (176) 
T (n + 5) 


Since M(t) ~ t + 44(t — ©), the accuracy increases with increasing 
t. This is characteristic of the applications to the renewal function. 

An example will now be considered in which the interarrival time 
distribution is a mixture of exponentials since this is of frequent practical 
use. Accordingly let 
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Table Il — Comparison of approximations 








t Mi(t) s(t) t+ 5+ hilt) Mit) 
0 0 0 0 0 
0.5 0.83333 0.85764 0.84297 0.86007 
1 1.39443 1.42283 1.41014 1.42466 
2 2.44338 2.47255 2.46303 2.47161 
5 5.48142 5.50480 5.49568 5.49718 
10 10.49350 10.50977 10.49980 10.49989 
a" 3 
A(t) = 1—-—e-t — — e-2t, 
(t) 10° a (177) 
Then, 
: 7 1 6 1 
s)=— a ee 
(s) 10s+1 10s4+2 ngs) 
1 20+ 18s | 
M(s) = —-———_-- 179 
(8) = a 17 + 10s ay) 
Also one has 
20. 21 
M(t) =—t+——+f(t 
(t) 17° + 599 f(t) (180) 
210 1 
t) = ———_——__.. 181 
fol) = ~ 389 17 + 10 a8l) 
Application of Theorem 12 provides 
8400 1 
t) = —-—— ———_—__. 182 
MW) =~ "989 (Tt + 202 a 
5.67 X 105 1 
t) = ——————_ —___.. 183 
oe 289  (17t + 30)3 — 
The exact solution for this simple example is 
20. 21 
M(t) =—t +— (1 - ex) (184) 
17-289 


Table III compares calculations from M(t) and the enhancement pro- 
cedures of (134) and (155) with the exact value. The a enhancement 
procedure with a = 1.7 was not used because it produces the exact re- 
sult. 

This exampleshows the operation of the enhancement procedures (134) 
and (155); clearly, s2(t) is very accurate since the constraint of positivity 
of the approximation operators is discarded in its construction. 
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Table Ill — Comparison of calculations 


t M2(t) s(t) holt) M(t) 
0 0 0 0 0 
0.5 0.62652 0.62967 0.62712 0.62984 
1 1.23024 1.23559 1.23127 1.23586 
2 2.41812 2.42353 2.41913 2.42318 
5 5.95373 5.95595 5.95407 5.95500 
10 11.83713 11.83747 11.83710 11.83737 


Vil. THE COVARIANCE AND RECOVERY FUNCTIONS 


The study of errors in switch count and continuous scan observational 
methods in telephone traffic engineering is facilitated by use of the co- 
variance function of the number of busy trunks in the Erlang blocking 
system.!! Specifically let x; be the number of trunks busy at time t in 
an equilibrium M/M/C blocking system with unit mean holding time 
and offered load of a erlangs, then the covariance function, R(t), is 


R(t) = E(xoxe) — (Ex0)?. (185) 


In order to express the Laplace transform, R(s), it is necessary to in- 
troduce the Poisson-Charlier polynomials!®!8 which may be obtained 
from 


Gi(x,a) = & (-Divr (? ) yla-¥ (*) (186) 
v=0 v v 
They satisfy the following recurrence 
Gj41(x, a) = a Ga) ~2G)-(z, a) (187) 
a 


Go(x,a)=1 Gi(x,a)= zs =e 


Also needed is the function a;(x, a) given by 


Gj-1(x, a) 
. ey Se 188 
aj (x, a) GiGi) (188) 
which satisfies the first order recurrence 
oji(x, a)-} = 2 —F aj (x, a) (189) 
a a 


ay(x,a) = G _ 1). 


The zeros of G(x, a) are all positive and simple; in particular, the zeros 
of G;(—s — 1, a) as a function of s are designated r; and ordered by 
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es bs STS 0: (190) 


In the approximation to be developed, r will be the dominant root. 
The Erlang loss function,!° B(c, a), given by 


cy ec qi 
Baas ea 
gives the probability that all servers are busy. In the formulae below it 


will be designated simply by B. The mean number of servers busy, nu, 
is 


pw =a(1—B) (191) 
and the variance, o?, of the number of busy servers is 
o27=p—al(c—w)B. (192) 


The Laplace transform, R(s), and the covariance, R(t), are! 





Ris) = + au acB 
its s(1lts) (1+:s)? 

pe? acB 
— + ——. a, (-s - 1, 193 
s sats! ay a2) 

c a“B 

RQt)= > Aet Aps=-——— ( ) (194) 
% 4 rjQ + rj)? ixj rp—ri 


The approximation Ro(t) is 
oF +p? ant 
i a oe 2 Lae 


2 





Ro(t) = = 








- acBt acBt? 1 
Eee aa pete (195a) 


t 
Since the dominant root is r; one may choose a satisfying 0 <a <—r, 
to obtain 


Roa(t) = o7e~*g(t) 





1+ p?/o? apt/o? 
sO) TF a-at Gatl+ dah 
p2/o2 acBt/«a2 
~1l-at (+(-a)t)? 
acBt2/o2 


i 
(1—at)(1+ (1—a)t)? oe G - ta, a) - (195b) 


It is known that the zeros r; are separated by at least one so that 1 — 1/(r; 
— r;) > 0, and hence A; is positive for each j; thus R(t) is log-convex. 
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Accordingly, the following inequality is valid (Theorem 23): 
R(t) < Ro, a(t). (196) 


In order to facilitate the use of Ro,.(t) an accurate upper bound for 
r; is needed to provide a suitable choice for a. Such a bound is available 
in Ref. 12. Thus let 


c 1 
= > -ce%a-’ (197) 
p=l1vV 

ec 1 v-1] 
m= 2-2 > -ea-’ + = (198) 

v=2V j=1J 

then 
c 

< = (199) 

eo DE ene o> 


To illustrate the practical performance of (195b) and (199), calcula- 
tions were made for the cases a = 4, 8, 12 and c = 8 corresponding to 
medium, heavy, and very heavy loads respectively. The corresponding 
equilibrium blocking probabilities are B(8, 4) = 0.030420, B(8, 8) = 
0.235570, B(8, 12) = 0.422655. Table IV compares the exact and ap- 
proximate values. Figures 1(a), 1(b), and 1(c) compare the corresponding 
curves. 


Table IV — Comparison of exact and approximate values 


a=4 a=8 =12 

t R(t) Ro,a(t) R(t) Ro, a(t) T(t) Ro,a(t) 
0 3.377 3.377 2.564 2.564 1.492 1.492 
0.4 2.143 2.145 1.075 1.091 0.312 0.331 
0.8 1.365 1.367 0.474 0.483 0.075 0.079 
1.2 0.870 0.872 0.212 0.216 0.019 0.020 
1.6 0.555 0.556 0.095 0.097 0.005 0.005 
2.0 0.354 0.355 0.043 0.043 0.001 0.001 
2.4 0.226 0.227 0.019 0.019 
2.8 0.144 0.145 0.009 0.009 
3.2 0.092 0.092 0.004 0.004 
3.6 0.059 0.059 0.002 0.002 
4.0 0.038 0.038 0.001 0.001 


The quality of approximation of (199) may be seen from the following 
values of a used in (195b) compared to the exact r, values. 


a TT: Qa 
4 1.1218 1.1215 
8 2.0000 1.9730 
12 3.4778 3.3415 


The transition probabilities P;;(t)—the probability j trunks are busy 
at time ¢ given trunks are busy at time zero—may all be obtained from 
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the transition probability P,,(¢)4; this probability as a function of time 
is called the recovery function. It may be used in a similar manner to the 
covariance function, R(t), for the study of errors in scan measurement 
techniques!4; additionally it is especially important in the analysis of 
telephone retrial models. 

The Laplace transform, P..(s), and the recovery function, P.,(t), 
are}! 





P..(s) = -+ eae es 1, a) (200) 
as 
P..(t) = B— ¥ Bjerit (201) 
j=l 
B;=— 1- . (202) 
: HL ( oe) 


As for the covariance function, B = B(c, a), andr;(1 <j <c) are the roots 
of G.(-—s — 1, a) as a function of s. 
In order to apply the a enhancement procedure, the function 


f(t) = Pee(t) — B (203) 


is considered whose Laplace transform is 
jisy=*[1-B +S ac(-s- 1,0) |; (204) 
S a 


It may be observed from (201) and (203) that f(t) is log-convex, hence 
the approximations obtained will constitute upper bounds. In order to 
demonstrate the operation of the approximations, the functions fo,.(¢), 
fije(t), and 


s(t) = 2fta(t) — fo,a(t) (205) 


were constructed; they are 


fo,a(t) = a |[1-8+t es (-F+a-12)|, (206) 


at 9 
pci [1-B+£a.(-F+a-12)| 
1 2 a t 


—at 
a (-=+ a 1a): (207) 
at at t 


2 
The prime on a, (x, a) indicates differentiation with respect to x. The 





+ 
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COVARIANCE FUNCTION — R (t) 


O = EXACT SOLUTION 
O = ZEROTH ORDER APPROX 
WITH ALPHA 


NUMBER OF TRUNKS c= 8 
OFFERED LOAD a= 4 ERLANGS 


(a) 


O = EXACT SOLUTION 

© =ZEROTH ORDER APPROX 
WITH ALPHA 
NUMBER OF TRUNKS c= 8 
OFFERED LOAD a= 8 ERLANGS 


(b) 





Fig. 1—Covariance function. 


O = EXACT SOLUTION 
O = ZEROTH ORDER APPROX 
WITH ALPHA 


NUMBER OF TRUNKS c=8 
OFFERED LOAD a= 12 
ERLANGS 


(c) 





following recurrence relation is obtained from (189); 


Since s;(t) does not correspond to a positive operator, it does not provide 


, 1 Peet a rae 
aj+1(x, a) = a Lia; (x, a) — 1]ajei(x, a)? 


a bound for P,,(t); one has, however, 


ay(x,a) = — 7 a(x, a)?. 


Pec(t) S$ B+ fia(t) S B+ fo,a(t). 


The first inequality follows from Theorem 23 and the second inequality 
from Theorem 21. 

The same cases as for the covariance function were treated. Tables 
V, VI, and VII compare the exact and approximate values. Figures 2(a), 


2(b), and 2(c) compare the corresponding curves. 





Table V— a=4 


(208) 


(209) 


t Pee (t) B + fo,a(t) B + fiat) sx(t) 
0 1.0000 1.0000 1.0000 1.0000 
0.1 0.5178 0.5907 0.5597 0.5287 
0.2 0.3304 0.4280 0.3856 0.3432 
0.3 0.2380 0.3335 0.2901 0.2468 
0.4 0.1844 0.2703 0.2296 0.1889 
0.5 0.1497 0.2249 0.1880 0.1511 
0.6 0.1256 0.1907 0.1578 0.1248 
0.7 0.1080 0.1641 0.1350 0.1059 
0.8 0.0947 0.1429 0.1174 0.0918 
0.9 0.0844 0.1258 0.1034 0.0810 
1.0 0.0762 0.1118 0.0922 0.0726 

Table Vi— a= 8 

t Pec (t) B + fo,a(t) B+ fralt) si(t) 
0 1.0000 1.0000 1.0000 1.0000 
0.1 0.5756 0.6335 0.6088 0.5842 
0.2 0.4379 0.5005 0.4725 0.4445 
0.3 0.3727 0.4256 0.4006 0.3756 
0.4 0.3347 0.3770 0.3561 0.3352 
0.5 0.3099 0.3432 0.3262 0.3092 
0.6 0.2927 0.3187 0.3050 0.2913 
0.7 0.2802 0.3005 0.2895 0.2786 
0.8 0.2708 0.2866 0.2779 0.2692 
0.9 0.2637 0.2760 0.2690 0.2621 
1.0 0.2581 0.2677 0.2622 0.2567 


This example will be used to show the operation of the error estimate 
(112). Using the increment h = 0.1, (158) and (159) were used to obtain 


ble*“fo a(t), 07[e“fo .(t)] and 6le“fi a (t)I, 07[e“f, .(t)] at t = 0.5. 


Equation (122) was used to estimate €9,.(t), €1,a(t). The error in s;(¢) was 
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RECOVERY FUNCTION — Pcc (t) 


0 = EXACT SOLUTION 


O = ZEROTH ORDER APPROX 
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© = ZEROTH ORDER APPROX. 

4 = FIRST ORDER APPROX. 

+= R,(7) 
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(b) 
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Fig. 2—Recovery function. 
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Table VIl— a= 12 
t P.(t) B + foa(t) B+ frat) si(t) 


0 1.0000 1.0000 1.0000 1.0000 
0.1 0.6245 0.6679 0.6493 0.6307 
0.2 0.5255 0.5629 0.5458 0.5286 
0.3 0.4834 0.5097 0.4969 0.4840 
0.4 0.4611 0.4789 0.4698 0.4607 
0.5 0.4479 0.4598 0.4535 0.4472 
0.6 0.4396 0.4476 0.4427 0.4378 
0.7 0.4342 0.4396 0.4366 0.4336 
0.8 0.4306 0.4342 0.4322 0.4301 
0.9 0.4282 0.4306 0.4292 0.4278 
1.0 0.4265 0.4281 0.4272 0.4262 


estimated by 2¢1,4(t) — €0,.(t) in which the estimates for ¢0,a(t), €1,0(t) 
were used. The results obtained are given in Table VIII. 


Table VIIl — Error estimates at t = 0.5 
a 0,0 Estimate E10 Estimate s,—f Estimate 
4 0.0752 0.0714 0.0383 0.0372 0.0014 0.0030 
8 0.0333 0.0317 0.0163 0.0160 —0.0008 0.0002 
12 0.0120 0.0113 0.0056 0.0001 —0.0007 —0.0111 


Vill. SOME APPLICATIONS OF THEOREM 12 


The generating function, which will be designated G(z, t), of Theorem 
12, namely, 


G(z,t)= 








ho =) (210) 


may sometimes be used to obtain explicitly the form of f, (t). The fol- 
lowing are some examples. 
For f(t) = cos t, one has fo(t) = 1/(1 + t?), hence 
1-z 
(1-—z)?+t? 
The generating function for the Chebyshev polynomials, T,,(t), of first 
kind is!® 


G(z,t) = (211) 


1— tz S 
1-2tz+22 <0 Tr (the (212) 
hence 
isd 1 
G(z,t)= © 2714+ t2)-@+vP7,, (a) . 213 
ete Vite a 


One now obtains 
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t lia ee 


fu(t) = Ln cost =[1+(—25) | 





vie (5) 


(214) 
For f(t) = sin t, one has f(t) = t/(1 + t?), hence 
t 
G(z, t) = —————.: 
(z, t) (24 P (215) 
The generating function for the Chebyshev polynomials, U,,(t), of second 
kind is 








1 foe) 
a aea a9” E Uale 216) 
hence 
© 1 
G(z, t) = mL ear ET )- 
2 ( ) Via (217a) 
Thus 
27-(n+1)/2 
fa(t) = Ly sint -— [1+ ( l ) | 
n+1 n+l 
(217b) 





1 
Un SSS . 
t 
4/14 ( ) 
n+1 
The Bessel functions provide additional interesting relations with 


orthogonal polynomials. For f(t) = Jo(t), one has fo(t) = 1/V1 + ¢2, 
and 


1 
C2.) =a (218) 
ie) V(1l—z)?2+t? 
The Legendre polynomials, P,,(t), are generated by 
; = P,(t)z" ag) 


V1 —2tz+22 =o 


hence 


folt) = Indolt) =| 1+ (5 


Reeves 
n+ i) | 
Wa 





i) 


(220) 
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By the substitution of it for t, one derives immediately 


fr(t) = Lplo(t) = E : ( t ‘leak 





n+l 


(221) 





1 
Pe SoS . 
Via) 
ae wae 
Since Jo(t) is convex, one also has 
I(t) S Lnlolt) (222) 
for sufficiently large n. 


As another example relating to Bessel functions, consider f(t) = 
Jo(2V't), then fo(t) = e-* and 


G(z,t)= 





fig (223) 


The generating function for the Laguerre polynomials, L,,(¢), is 





e—t2/(-2) = X Ln(t)z” (224) 
baw n=0 
hence 
t 
= = p—t/(nt1) as es 995 
fn(t) = Lndo(2Vt) = e Ln ean ; (225) 
iX. SUMMARY 


The methods of this paper have been found particularly useful in 
analyzing complex queueing phenomena whose Laplace transform 
representations are quite often implicitly defined. The error estimate 
of (112) has been found especially useful. Its computation is numerically 
effected by use of (158) and (159). 

It would be desirable to have an effective method of estimating the 
a parameter of (114) directly from f,,(t). In fact a method of this type 
which yields a rough evaluation has been devised and will be reported 
in a later paper. Of interest also would be further elaboration of the way 
structural properties of f(t) are reflected in fp,_(t). 

The investigation of linear combinations of iterates, L7,, of the oper- 
ators L, may prove useful in providing additional enhancement methods. 
Especially, further investigation is needed concerning enhancement 
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methods which preserve the positivity of the approximation process. 

The isolated result of Theorem 16, which shows that exp(L, @,/(t)) 
is a better approximation to f(t) than f,(t) when f(t) is log-convex, 
should be examined with the purpose of the possible construction of 
nonlinear approximation methods exploiting this structural charac- 
teristic. 


X. ACKNOWLEDGMENTS 


I should like to acknowledge the helpful discussions with B. W. Stuck 
and E. Arthurs during their application of these methods to the analysis 
of flow distributions for computer models, and to acknowledge gratefully 
the excellent computer programming work of Rosemary Harris and 
Darlene Shearer. 




















APPENDIX 
Operations 
f(t) fo(t) or fn(t) 
1 1 t dx 
fF) * f, foe) = 
f(at),a>0 fn(at) 
at \—n-l t 
ne ere) fn eee ee =) 
m7 itt 
So pele n+ 1h (¢ 1 i) 
fo(t)-f(0) n+1 7 n 
j(t) oe a | Ce AE ae ‘) | (n> 1) 
#(t) fo(t) - eo — tf(0) y fi(t) — ee + f(0) 
-1 
fn(t) — 2fn—1 (——=t) + fn—2 (——t 
(n+ 1)? Worthen Gar) thelr), > 2) 
tf (t) tfo(t) + t?fo(t) 
tf (t) tfn(t) 
1 t 1 t 
rat f(x)dx j f, fn (x)dx 
f(t)*h(t)(Mellin) | f,(t)*h(é) 
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Observations of Errors and Error Rates on T1 
Digital Repeatered Lines 
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Measurements of errors on T1 repeatered lines were made at five 
Bell System offices during 1973 and 1974. They included an informal 
survey of error rates and error-free seconds on lines in service, and 
detailed recordings, normally of 24 hours duration on selected lines. 
The detailed recordings show the existence of at least two distinct error 
mechanisms, differing significantly tn diurnal variation of error rate, 
distribution of intervals between errors, and dependence on the 
transmitted bit pattern. It was found that certain T1 lines made errors 
when in service, driven by D1 channel banks, but not when driven by 
a pseudorandom test signal. 


I. INTRODUCTION 


The T1 repeatered line is a short-haul digital transmission system 
using cable pairs to transmit binary information at a rate of 1.544 X 106 
bits per second.! T1 lines have been in use since 1962, primarily in con- 
junction with D1 channel banks to provide a carrier system for voice 
channels. In the design of the T1 line, the distribution of error rates was 
an essential design parameter.” However, since the field trial of an ex- 
perimental prototype system, in which the error performance of one 
worst-case repeatered line was briefly reported,* it has become clear that 
more knowledge is needed on the subject of the error performance of 
working T'1 lines. 

Measurements of timing jitter and errors on T1 lines were made at five 
Bell System offices during 1973-74. The error measurements were of two 
types: a survey of error rates and error-free seconds; and detailed re- 
cordings of the error process, normally for 24 hours duration, on selected 
T1 lines. The jitter measurements will be reported elsewhere.* 

The survey consisted of error rate measurements (determined by 


* See pp. 95-96, Ref. 3. 
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counting bipolar violations on lines in service) on 2594 T1 lines, and 
error-free seconds measurements on 1640 of these lines. The primary 
purpose of the survey was to select particular lines, known to be making 
errors, for detailed recordings of the error process. Such recordings were 
made on 89 lines. However, many of these recordings showed few or no 
errors. Recordings of 37 of these lines were analyzed to characterize the 
following properties: diurnal variation of error rate, distribution of the 
intervals between successive errors, and sensitivity to the pattern of bits 
transmitted on the T1 line. Not all these properties could be analyzed 
for every line; there are 22 lines for which all three properties have been 
characterized. 

Most of the lines in the survey were terminated with D1 channel banks; 
some had D2 banks. All of the detailed recordings were made on lines 
with D1 banks. Since the pulse stream format generated by a D1 channel 
bank is different from that generated by other equipment (for example, 
in regard to density of ones), some specific results of these measurements 
are valid only for the D1 environment. 


ll. ERRORS AND THEIR MEASUREMENT 


The digital format on the T1 line is bipolar: that is, the absence of a 
pulse in any position represents a binary “zero,” while a pulse of either 
polarity represents a binary “one,” and consecutive pulses, regardless 
of the number of intervening zeros, have opposite polarity. The pulses 
become attenuated and distorted in transmission along the cable pair 
and are reconstructed by repeaters located at intervals of nominally 6000 
feet along the line. An “error” is an incorrect reconstruction In any one 
pulse position: a pulse where originally no pulse was sent, or a blank 
where a pulse should be. 

If one looks only at the presence or absence of pulses—the binary ones 
and zeros—errors cannot be detected unless one knows what was sent. 
But by looking at pulse polarity, one can detect violations of the bipolar 
format: the occurrence of two consecutive pulses (with or without in- 
tervening blank spaces) having the same polarity. Any single isolated 
error—either the omission or insertion of one pulse—always results in 
exactly one “bipolar violation” (BPV). On the other hand, multiple errors 
can combine so that a bipolar violation does not occur, while the reversal 
of the polarity of a pulse would create two bipolar violations without any 
errors, and, in any case, the occurrence of a bipolar violation does not 
define precisely where an error occurred or whether it was an insertion 
or deletion. In practice, however, the rate of occurrence of bipolar vio- 
lations is generally close to the error rate. Both are quoted as numbers 
representing the ratio of the number of events (errors or bipolar viola- 
tions) to the number of bits transmitted. 
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Another measure of performance, “percent error-free seconds,” is used 
to specify objectives for the Digital Data System.® To define percent 
error-free seconds, the measurement period is divided into 1-second 
intervals from an arbitrary starting point, and the 1-second intervals in 
which errors occur—‘“error seconds’”—are counted. In practice, this 
count is estimated by counting the intervals in which bipolar violations 
occur. The remaining intervals are “error-free seconds.” 

The surveys reported here were based on measurement of lines in 
service, carrying pulse streams that were unknown to us (except that they 
obeyed the constraints imposed by the D1 channel bank format). 
Therefore, although the terms “error rate” and “error-free seconds” are 
loosely used in referring to the results of the survey, all the survey results 
and measurements are actually in terms of bipolar violations rather than 
errors. 

On the other hand, the detailed recordings of selected lines were 
originally planned to record true errors on T'1 lines removed from service 
and carrying a known pseudorandom bit stream. As the program de- 
veloped we also included recordings of bipolar violations on T1 lines 
carrying unknown pulse streams from D1 channel banks. Therefore, in 
referring to the detailed recordings, “errors” and “bipolar violations” 
are not the same; these terms will be used more strictly in that context 
than in referring to the survey results. 


lil. THE SURVEY OF ERROR RATE AND ERROR-FREE SECONDS 
3.1 Survey procedure 


The scope of the survey is summarized in Table I. The activity at the 
first office visited was a pilot run for the rest of the program. The survey 
activity at this office consisted of 1-minute error counts using simple 
bipolar violation counters. After the pilot project a device was built to 
count bipolar violations for 54-minute intervals on 16 lines at a time, in 
conjunction with a PDP-11/20 computer (which was used primarily for 
the jitter measurements and the detailed error recordings). This device 
was used in the remaining four offices. After the survey at the second 
office, the software was modified to obtain counts of error seconds con- 
currently with the error counts. Error-second counts were thus made 
at the last three offices surveyed. 

Since the original purpose of the survey at each office was to select T1 
lines for extended error tests, general information about the population 
of lines surveyed was not recorded. Some information, however, is 
available about each office as a whole. Thus, for example, it is known that 
the lines were all two-cable at office 3, all one-cable at office 5, and mixed 
at the other offices. (A two-cable T1 system is one in which opposite 
directions of transmission are segregated in separate cables.) 
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Table | — Scope of the survey 


Survey Number 
Office dates of lines Data recorded 
1 3/1/73 to 461 1-min BPV 
4/1/73 
2 9/20/73 to 493 54-min BPV 
9/27/73 
3 10/31/73 to 561 54-min BPV and 
11/9/73 error-seconds 
4 11/29/73 to 7192 54-min BPV and 
1/9/74 error-seconds 
5 1/21/74 to 287 54-min BPV and 
1/24/74 error-seconds 
Total 2594 


The survey was not based on random sampling. Measurements began 
at one end of the office repeater bay lineup and continued either until 
the other end of the lineup was reached (at offices 3, 4, and 5) or until 
the expiration of the time allotted for activity at that office (at offices 
1 and 2). Lines that were out of service for any reason were skipped over 
and excluded from the survey. Measurements for the survey were made 
only on business days between the hours of 8:00 a.m. and 6:00 p.m. 

At each office repeater included in the survey, a bipolar violation de- 
tector was plugged into the receive monitor jack to detect errors in the 
pulse stream from the distant office. Each office repeater was counted 
as aseparate T1 line. A through system from one distant office to another 
via the office surveyed was thus effectively split into two parts and 
measured as though it consisted of two systems, each connecting the 
survey office with one of the distant offices. One consequence is that 
there are more short T1 lines (especially single-span lines) in the survey 
than in the plant. 

Error rate measurements at office 1 were made for 1-minute intervals 
on one line at a time using one of two different instruments. One was a 
Philco-Sierra 314A T1 error detecting set, which has a built-in counter 
with a maximum counting rate of 10 per second. The other was a Western 
Electric J98710G-2 L3 error detecting set connected to a General Radio 
1192 counter, which counted bipolar violations as fast as they could occur 
on the T1 line. 

At the other offices, errors were counted by a 16-line bipolar violation 
detector using a PDP-11/20 computer. The counting program had two 
versions, as indicated previously. Version 1, used only at office 2, counted 
only errors. Version 2, used at the last three offices surveyed, counted 
error seconds as well as errors. The counting rate of this equipment was 
subject to the following limitations. 


(i) (Version 2 only)—If the count in any one second exceeded 1544 
errors on all 16 lines combined (corresponding to 107? error rate on any 
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one line) the counting program was terminated, so that its use of com- 
puter time would not interfere with concurrent detailed recording or 
jitter measurement on another line. 

(it) Maximum short-term (fractional-second) counting rate on all 
lines combined was about 10,000 per second, because the computer took 
about 95 us to process each error counted (about 60 us in Version 1). 

(iit) Resolution varied from normally about 30 us to several hundred 
microseconds depending on error activity. Errors closer together than 
this on the same T1 line would be counted as a single error. 


3.2 Survey data processing 


~The survey data was recorded manually at the test site and later 
transcribed to punched cards. The survey data consisted of groups of 
up to 16 lines, monitored for 54 minutes per group, except at office 1 
where each line was monitored for 1 minute. Twenty lines that lost signal 
at some time during the 54-minute test were eliminated. Error counts 
were converted directly to error rates by dividing by the number of T1 
line bits expected in the monitoring interval at the nominal line rate. 
Computer programs were used to obtain statistical summaries of three 
quantities for each line (where available): the percent of seconds with 
error, the error rate, and a clustering factor. These are all represented 
graphically by cumulative distribution functions in Figs. 1 to 6. 

The clustering factor is not simply the ratio of the number of errors 
to the number of error seconds. This would give large overestimates of 
clustering at high error rates (above about 10~®) because the number of 
error seconds in 54 minutes is bounded. The calculated clustering factor 
therefore compares the actual count of error seconds with the number 
of error seconds that would be expected if the errors counted had oc- 
curred randomly (Poisson process model). This factor gives the same 
result at low error rates, but approaches unity at high error rates. 

For a 54-minute test, the error rate (strictly, the bipolar violation rate) 
is 
bipolar violation count 

1.544 X 10° X 60 X 54 


The fraction of seconds that contain errors is 


ER = 


_ error second count 
60 X 54 


The percent error-free seconds is simply 


PEFS = 100 X (1 — ESF) 


ESF 


Given an error rate ER, a Poisson process model could be used to predict 
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an error second fraction 
ESF’ = 1 — exp(—1.544 x 10° X ER) 


The clustering factor is defined as 
_ ESF’ 
ESF 


The graphs were arranged so that the horizontal axis consisted of a 
logarithmic scale and the vertical axis consisted of a normal probability 
scale; thus, a lognormal distribution would be plotted as a straight line. 
Percent of seconds with error and BPV rate graphs were based on total 
population, i.e., lines with and without errors. The lowest point plotted 
corresponds to the percent of lines that had either no errors or one error. 
Clustering factor graphs were based solely on T1 lines with errors. To 
avoid step functions appearing on the graphs, a continuous curve was 
created by plotting only the upper “corners” of the step function. 


CF 





3.3 Results of the survey 


Figure 1 shows the distribution of error rate at each office. Each point 
on a given distribution can be interpreted as a statement that some 
percent of the lines had error rates equal to or less than (better than) a 
certain error rate. For example, about 98.8 percent of lines at office 2 had 
error rates equal to or better than 10-6. Because of the counting rate 
limitations described previously, the true error rate may be somewhat 
higher than the measured error rate for some lines. On the average, it 
is probably not more than twice the measured error rate, based on esti- 
mates of clustering from an earlier survey® as well as results from our 
detailed error recordings. Thus, for the same example, we would estimate 
that at office 2 less than 98.8 percent of lines had better than 10-6 error 
rate, but probably more than 98.8 percent had better than 2 X 10-6 error 
rate. That is, the true error rate curve might be farther to the right, by 
less than a factor of 2 in error rate. 

Figure 2 shows the aggregate distributions of error rate compared with 
previous surveys. All the surveys refer to lines terminated predominantly 
or entirely at D1 channel banks. Two curves are shown for our survey 
because measurements of error rates below 1078 are not available for 
office 1. However, the two curves are very close together. 

The previous surveys had suggested a possibility that 'T1 error per- 
formance was changing. The latter of these surveys,® in 1971, showed 
error rates about 10 times as high as the earlier survey,’ in 1966. A survey 
at one site in 1968 showed a distribution (not shown in Fig. 2) lying be- 
tween the other two.® However, the results of our survey show such large 
differences between offices that the difference between the 1966 and 1971 
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PERCENT OF LINES BETTER 





10-10 10-9 10-8 10-7 107 & 10-5 10-4 
ERROR RATE 


Fig. 1—Error rate distributions for individual offices. 


surveys cannot be said to show a significant trend over time. Qualitative 
observations suggest that the important difference between offices is 
in maintenance organization and effort, which tends for practical reasons 
to be correlated with (although not fully determined by) both office size 
and T-carrier network size. 

Offices 2, 3, and 4 in our survey had error rate distributions very close 
to the 1966 survey. Even allowing for a possible underestimate of a factor 
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Fig. 2—Aggregate error rate distributions. Results of previous surveys are shown for 
comparison. 
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Fig. 3—Distributions of percent seconds with error, compared with the aggregate of 
the previous survey. 


of 2, they are still much better than the 1971 error rate distribution, and 
much closer to the 1966 curve than to the 1971 curve. These three offices 
are all large offices in large metropolitan networks, as were the offices 
in the 1966 survey. In the 1971 survey, on the other hand, two of the three 
offices were suburban offices. 

The office showing the poorest distribution in our survey was also the 
smallest. This building housed two step-by-step switching machines, 
and a combined maintenance force was responsible for both switches 
and T-carrier equipment. This distribution was noticeably affected by 
the existence of high error rates (between 10-6 and 10~*) on 12 lines in 
the same cable (4 percent of the sample), apparently due to a single cause; 
omitting these lines would have moved the curve closer to the others. 

Error-free seconds results are plotted in Fig. 3 in terms of percent 
seconds with error (which is 100 minus percent error-free seconds), so 
that the distribution curves are similar to the error rate distributions. 
These results are not affected by the counting-rate limitations of the 
measurement apparatus. The best error-seconds distribution in our 
survey (office 3) is better than the aggregate of the 1971 survey for about 
a factor of 4 in percent seconds with error (see Fig. 3); our aggregate is 
better by a factor of 2. The scatter plot in Fig. 4 indicates that most lines 
had about the same number of errors as seconds with error (except at 
high error rates, where the number of seconds with error approaches the 
number of seconds in 54 minutes). However, clustering can occur at all 
levels. Figs. 5 and 6 show the distribution of clustering factor (calculated 


718 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


A OFFICE 3 
© OFFICE 4 


O OFFICE 5 


PERCENT OF SECONDS WITH ERROR 





10-19 10-9 10-8 1077 1076 10-5 10-4 
BIPOLAR-VIOLATION RATE 


Fig. 4—Scatter plot of percent seconds with error vs. error rate. 


so as to avoid false indications of clustering at high error rates, as de- 
scribed in the preceding section). Table IT shows the mean and standard 
deviation of the clustering factor for each office and for the aggregate. 
In general the standard deviation is much larger than the mean, except 
at office 3. Overall, the mean is about two events per second-with- 
error. 
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Fig. 5—Distributions of clustering factor for individual offices. 
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Fig. 6—Aggregate distribution of clustering factor. 


Table Il — Statistics of clustering factor 
Standard 
Office Mean deviation 
3 1.20 0.98 
4 3.13 22.9 
5 1.70 4.93 
Aggregate 2.26 16.5 


The difference between clustering factors at different offices should 
probably not be regarded as significant. The unusual statistics for office 
4 are due to just four lines with unusually high clustering factors (1 
percent of the lines with errors). At office 5, the sample was small, and 
it is known that several lines could be disturbed by the same condition 
(although the 12 lines with high error rate in the same cable, mentioned 
earlier, did not have high clustering factors). 


IV. DETAILED RECORDINGS OF ERRORS AND BIPOLAR VIOLATIONS 


4.1 Experimental procedure 


This phase of the measurement program was designed to obtain 
complete and continuous recordings of errors on T1 lines for periods of 
24 hours (or over a weekend), recording at what time, and on which bit, 
each individual error occurred. We made no effort to find or fix the cause 
of the errors. 

As originally conceived, the basic procedure was to remove a line from 
service and apply a test signal with the line looped back to the test site, 
so that the received signal could be directly compared with the trans- 
mitted signal. The test apparatus is shown schematically in Fig. 7. The 
test signal was a pseudorandom linear shift-register sequence? with a 
period of 1,048,575 bits, generated by a 20-bit shift register, as shown 
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Fig. 7—Error measurement system block diagram. 


in Fig. 8. The bit stream returned via the looped T1 line was compared, 
bit by bit, with the output of a similar sequence generator to detect errors 
on the line. The occurrence of each error was recorded on magnetic tape. 
A Digital Equipment Corporation PDP-11 computer controlled test se- 
quencing and formatted the output to the tape. 

In most cases the entire T1 line was not removed from service. A T1 
line from one office to another generally passes through several inter- 
mediate offices, which divide it into spans of the order of 5 miles long. 
Spare spans are normally provided. The least troublesome procedure 





















9 10 11 12 13 14 #165 16 17 18 19 20 





...11010011010010010010010010010000000000000000000100000... 


Fig. 8—Pseudorandom sequence generator. The digits in the shift register move from 
left to right, and hence appear in the output as read from right to left. Generation of the 
first of the string of 19 zeros is illustrated. 
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to obtain a length of T1 line for testing is to patch service on one span 
onto a spare. This procedure requires patching only at the test site and 
one adjacent office, and does not involve any other offices. 

In the earliest measurements at office 1, lines were chosen at random 
to record both jitter and errors. Since most of the lines so chosen did not 
make any errors, the bipolar violation survey was introduced as a means 
of selecting lines for error test that were known to make errors, con- 
tinuing to use random selection for the jitter measurements. We reasoned 
that if bipolar violations were observed in the pulse stream coming from 
the D1 channel bank at the distant terminal office, errors must have been 
occurring somewhere on the line, and there was a reasonable probability 
that these errors would be in the span adjacent to the test site. 

This procedure resulted in a reasonable yield of error observations 
at office 1 and office 2. However, it was not successful at office 3. In an 
effort to improve the yield at this office, lines were chosen that termi- 
nated at the next office, so that looping one span would necessarily in- 
clude the part of the line where the bipolar violations originated. But 
when these lines were looped and the pseudorandom test signal was 
applied, errors either did not occur at all, or occurred only occasionally, 
at rates far lower than the BPV rates observed in service in the survey. 
Such cases had been observed occasionally at offices 1 and 2. In addition, 
at office 4, we were able to loop T1 lines back to the test site at distant 
terminal offices, and again we observed that most lines that showed bi- 
polar violations when in service did not make errors when carrying the 
pseudorandom test stream. In most cases we verified that the bipolar 
violations did not originate in the channel bank. 

The error recording equipment was therefore modified, late in the 
measurement program, so as to be able to record bipolar violations in 
the same manner as errors. The procedure was modified to include 
pretests that would indicate systematically whether the line should be 
tested using bit generators and the error detector to record errors (“error 
run”), as shown in Fig. 9a, or restored to service and tested as in Fig. 9b, 
to record bipolar violations in service (“BPV run”). 

The modified procedure included two pretests designed to investigate 
the effect of the signal on the error performance, using the configuration 
shown in Fig. 9c. The T1 span was out of service and looped at the next 
office as in Fig. 9a, but use was made of the signal from the D1 channel 
bank, which was now routed to its destination via the spare. The dashed 
lines show alternative connections. One pretest recorded BPVs on the 
spare. In the next pretest, a signal tapped from the spare was fed into 
the looped span, and BPVs were recorded on the signal coming back from 
the looped span. The significant result was that in many cases, BPVs were 
not seen on the spare, but were seen on the signal that came from the 
spare via the looped span, although errors did not occur when the signal 
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Fig. 9—Error and BPV test configurations: (a) error run, (b) BPV run, (c) BPV pre- 
test. 


from the bit generator was transmitted on the looped span (as in Fig. 9a). 
That is, errors would, or would not, occur on the looped span, depending 
on the signal fed into it. 

Each continuous recording of errors or bipolar violations on a par- 
ticular line or span is referred to as a “run.”’ Normally the duration of 
arun is about 24 hours. Longer runs occurred when a run was successfully 
started on a Friday, since in that case the run was allowed to continue 
over the weekend. Shorter runs occurred when a line under test turned 
out to have few or no errors in a period of a few hours, or had an abnor- 
mally high error rate; the operator would then terminate the run and 
select another line for test. Each run was recorded as a distinct file on 
magnetic tape. 

A plot of error rate versus time was derived for each run as a whole. 
For further analysis, “samples” of up to 1600 consecutive errors were 
extracted. At low error rates, a “sample” could include a whole 24-hour 
run. Where possible, separate samples were taken when the plot of error 
rate versus time showed different conditions occurring at different times. 
These samples were analyzed to study the “error-free interval” distri- 
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Fig. 10—Normal diurnal variation of error rate, recorded over a weekend, in an error 
run. 


butions (distributions of intervals between successive errors) and the 
dependence of error probability on the content of the bit stream. 


4.2 Diurnal variation of error rate 


Since most of the error and BPV runs lasted about 24 hours, diurnal 
variation of error rate could be observed, but could not be entirely sep- 
arated from long-term and short-term variation. Even when a run lasted 
over a weekend, diurnal components cannot be fully separated because 
a weekend does not consist of identical diurnal cycles. However, some 
distinct types of diurnal variation were identified. 

In roughly one-third of the error and BPV runs of 1 day or longer, ah 
error rate had a broad maximum during the working day (with short- 
term variations superimposed), falling off gradually during the evening 
to a-‘minimum error rate at about 4:00 a.m., and rising again to its 
workday level about 8:00 a.m. Figure 10 shows such a pattern observed 
over a weekend. We refer to this pattern as “normal” diurnal varia- 
tion. . 

Most of the remaining error runs,.and a few of the BPV runs, showed 
essentially steady error rates over the day, as in Fig. 11. Irregular vari- 
ations, on the other hand, were observed in most of the remaining BPV 
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Fig. 11—Steady error rate, recorded over a weekend, in a BPV run. Each plotted point 
is computed from the number of errors ina cell about 5 minutes long, containing an average 
of about 5 errors; hence the point-to-point variation is entirely accounted for by assuming 
errors occurring independently at random (Poisson process), except at the very end of the 
run. 


runs, and a few error runs. Fig. 12 shows distinctly irregular variation, 
with large nonperiodic changes. However, any variation that did not fit 
a recognizable pattern was classified as irregular. 

The predominance of irregular variation in the BPV runs (as contrasted 
with steady error rates predominantly in the error runs) may be due to 
the variability in the content of the bit stream from the channel bank. 
As described in a later section, the lines with steady or irregularly varying 
error rates tended to be sensitive to the bit pattern on the lines. The 
invariably repeating pseudorandom sequence in the error runs would 
tend in such cases to give error rates that were constant with time; a pulse 
stream with variable content, such as the channel bank output, would 
result in varying error rates. 

Two T1 lines (one BPV run, one error run) showed two distinct error 
rates during the diurnal cycle, with abrupt transitions from one error 
rate to the other, as in Fig. 13. This pattern has been classified as “‘ex- 
aggerated” diurnal variation. 

Table III lists the total number of error and BPV runs with each type 
of diurnal variation. Only runs that actually extended overnight are 
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Fig. 12.—Irregular variation of error rate, recorded for 24 hours, in a BPV run. 
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Fig. 13—Exaggerated diurnal variation, recorded for 24 hours, in a BPV run. 
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Table Ill — Frequency of occurrence of different types of diurnal 


variation 
Variation Error-run BPV-run Total 
type lines lines lines 
Normal 6 4 10 
Steady 6 1 7 
Irregular 3 5 8 
Exaggerated 1 1 2 
Total 16 11 27 


shown; shorter runs are excluded because the error rate minimum near 
4:00 a.m. was an essential feature for identification of the “normal” 
type. 

In the diurnal variation curves in Figs. 10 through 13, each point 
plotted represents the average error rate in a 5-minute cell. Since the 
cell boundaries were defined by the starting times of the data records 
on the tape, which occurred irregularly, the cells are not exactly 5 minutes 
long. However, the error rate for each cell is correct, being determined 
by dividing the number of errors by the actual number of bits in the 
cell. 


4.3 Distribution of intervals between errors 


Distributions of “error-free intervals” (the intervals between suc- 
cessive errors, in time units equal to the reciprocal of the bit rate) were 
plotted for every error sample. Three principal types were observed, 
identified as unimodal, bimodal, and trimodal. The same types were 
observed in both BPV runs and error runs. Table IV shows the number 
of lines observed with each type of distribution. 

A typical unimodal distribution is shown in Fig. 14. This curve may 
be interpreted (except for the numbers on the vertical axis) as the 
probability density function of the logarithm of the interval, considering 
the interval as a continuous random variable. The curve was actually 
derived by setting up a logarithmic interval axis, dividing it uniformly 
into 40 cells per decade, and plotting the number of intervals that fell 


Table IV — Frequency of occurrence of different types of 
distribution of the intervals between successive errors or BPVS 


Distribution Error-run BPV-run Total 

type lines lines lines 
Unimodal 2 4 6 
Nearly unimodal 6 5 11 
Bimodal 5 2 7 
Trimodal 6 3 9 
Total 19 14 33 
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Fig. 14—Unimodal distribution of “error-free intervals,” observed in a BPV run. 


in each cell. The presence of only one mode indicates the absence of 
clustering. The asterisk just above the horizontal axis, in the vicinity of 
the mode, is plotted at the reciprocal of the mean error rate, that is, at 
the mean interval between errors. 

Also shown in Fig. 14, as a dashed curve, is the probability density 
function of an exponential distribution with the same mean, scaled to 
fit the same logarithmic horizontal axis and to represent the same total 
number of errors on the vertical axis. The apparent fit suggests that the 
errors occurred independently (that is, as a Poisson process). As will be 
shown later, errors on this line were not independent, but depended 
strongly on the immediate pattern of bits on the line. However, there 
is a clear indication of large-scale independence: error-sensitive patterns 
occurred regularly, but the occurrence of an actual error in each pattern 
was independent of previous occurrences. 

Fig. 15 shows a phenomenon that (fortunately for service) did not 
happen very often, or last very long: a very high error rate. The distri- 
bution shows a very low mean interval between errors. For the shortest 
intervals, the number of occurrences of each discrete interval are resolved 
in the plot. Clearly the errors are not completely independent (as in 
Bernoulli trials), because the count would then be highest at 1 and would 
decrease with increasing interval length. But the independent-error 


728 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


COUNT OF EFI’'S OF LENGTH X 





1071 1 10 102, «-102)—s 104 ~—:108 
ERROR FREE INTERVAL, BITS 


Fig. 15—Distribution of “error-free intervals” at very high error rate, observed in an 
8 g 
error run. 


model is conceptually useful as a point of departure. These distributions 
are not included in Table IV. 

The bimodal distribution shown in Fig. 16 indicates error clustering. 
The mode at the right represents the intervals between clusters, and its 
shape again suggests large-scale independence (Poisson process) as in 
Fig. 14, but in this case also the occurrence of errors actually depended 
on local bit patterns. (The small amplitude of this mode may be decep- 
tive; it actually includes about one-fifth of the total number of intervals.) 
The mode at the left represents the intervals between errors within a 
cluster, and, as in Fig. 15, errors are not quite independent within clus- 
ters. However, the general appearance is not very different from what 
one would expect from a combination of two exponential waiting-time 
distributions. 

A substantial number of lines had interval distributions intermediate 
between the unimodal and bimodal types. In these lines, most of the 
intervals were grouped around a single mode at the right, with less than 
10 percent of the intervals (sometimes only one) defining another mode 
at the left. These distributions are referred to as “nearly unimodal.” 

The trimodal distribution in Fig. 17 indicates two levels of clustering, 
that is, clusters within superclusters. The mode at the right represents 
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Fig. 16—Bimodal distribution of “error-free intervals,” observed in an error run. 


intervals between superclusters, the mode in the middle represents in- 
tervals between clusters within superclusters, and the mode on the left 
represents intervals between errors in a cluster. Actually, in this termi- 
’ nology, a “cluster” or ‘“supercluster” could consist of a single error, since 
the average clustering factor was not large (usually about two to three 
errors per supercluster) and errors often occurred in isolation. On the 
other hand, it was not uncommon to find superclusters containing as 
many as 30 errors, occurring both alone and in clusters within the same 
supercluster. | 

Whenever a trimodal distribution was observed in an overnight run 
(permitting identification of the diurnal variation type), the line showed 
“normal” diurnal variation. Conversely, most of the “normal” diurnal 
variation runs showed trimodal interval distributions. In these cases, 
the interval distribution remained trimodal throughout the diurnal cycle. 
The middle mode, which indicates structure within the supercluster, 
remained usually in the same place, extending from about 100 to 500 bits, 
while the right mode, which described the occurrence of superclusters, 
moved left or right as the error rate on the line went up or down (re- 
spectively) with time of day. 

A number of distributions classed as bimodal (or nearly unimodal) 
differed in appearance from Fig. 16 because of other features. In two 
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Fig. 17—Trimodal distribution of “error-free intervals,” observed in a BPV run. 


runs, discussed in Section 4.5, only certain values of short interval oc- 
curred. In a BPV pretest for an error run, multiples of about 193 (the D1 
frame duration) predominated in the long intervals. In another sample, 
representing a burst of relatively high error rate in a BPV run, the long 
intervals are almost all multiples of about 20,000 bits (445 second). In 
most of these cases, longer and shorter intervals are interspersed, indi- 
cating error clustering. On the other hand, we obtained a sample from 
one error run that looks much like Fig. 11, but actually represents a 
transition from a low error rate, as in Fig. 14, to a very high error rate, 
as in Fig. 15; all the long intervals came first, followed by the short 
ones. 


4.4 Effects of local bit patterns 


On most lines, the probability of error depended on the local bit pat- 
tern being transmitted. The probability of error on any given line was 
usually not the same for ones as for zeros (bias effect), and also depended 
on the values of preceding and following bits (intersymbol interference 
effect). The pattern sensitivity varied greatly from line to line in both 
error and BPV runs. Pattern sensitivity was categorized as high, medium, 
or low for each line; Table V shows the number of lines observed in each 
category. 
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Table V — Frequency of occurrence of different levels of pattern 
sensitivity of probability of error or BPV 


Pattern Error-run BPV-run Total 

sensitivity lines lines lines 
High (250.0) 7 3 10 
Medium 0 8 8 
Low (<4.0) 7 3 10 
Totals 14 14 28 


Pattern sensitivity was directly observable in the error runs (except 
at office 2) because the bit stream on the line was a known pseudorandom 
sequence. (At office 2, a broken wire forced the second bit of every 16-bit 
data word on every tape to zero; the evaluation of error rate, diurnal 
variation, and the interval distribution was not seriously affected by the 
consequent loss of data, but pattern dependence could not be determined 
because identification of bit positions in the sequence was ambiguous.) 
In the error runs, pattern sensitivity fell clearly into two groups: very 
high and very low (except for short periods of very high error rate, which 
showed no discernible pattern sensitivity at all). In the BPV runs, evi- 
dence of pattern sensitivity appeared as a consequence of the periodic 
structure of the D1 channel bank frame. The bit stream is organized into 
frames of 193 bits at 8000 frames per second, in which one bit (the 
framing bit) is alternately one and zero in successive frames to enable 
the receiving channel bank to determine the frame phase. The remaining 
192 bits consist of 24 words of 8 bits each, one word for each channel, in 
which one bit carries signaling and the other seven represent the voice 
by pulse code modulation (PCM). Pattern dependence could be inferred 
by relating the occurrence of bipolar violations to the periodicities of the 
D1 channel bank frame, and indications of pattern dependence pre- 
sumably depended on the content of the channel bank output. 

In the error runs, bias effects were characterized by identifying each 
error as an insertion (error in a zero) or a deletion (error in a one) and 
computing the percent of errors that were deletions. The percent dele- 
tions observed on 14 T1 lines varied from 0 to 100 percent, in a manner 
consistent with a uniform distribution. 

Preliminary analysis suggested that intersymbol interference effects 
were usually confined to the bit following the error and the 6 bits pre- 
ceding it. These bits, together with the bit in error, comprise an 8-bit 
string with the error in the seventh bit. For each error sample, a tally was 
then made of the number of times each of the 256 possible 8-bit strings 
occurred with an error in the seventh bit. Figs. 18 and 19 show the results 
of two such tallies, representing examples of low and high sensitivity 
respectively. (In these figures, the seventh bit is shown as transmitted; 
the slash through it indicates that this was not the value received.) 
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As a numerical measure of pattern sensitivity, the parameter 
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was computed for each error sample, where N; is the number of times 
the ith 8-bit pattern occurred with an error in its seventh bit, and N is 
the total number of errors. This parameter is related to chi-square as it 
would be evaluated to test the null hypothesis of equal probabilities for 
all 256 patterns. However, while chi-square is useful primarily as a 
measure of statistical significance, D is designed to be nearly unaffected 
by the sample size for strongly pattern dependent errors, so that it 
measures pattern sensitivity as a property of the T1 line. The largest 
possible value of D, which would be attained if all errors occurred in the 
same pattern, is nearly 255 (actually, 255 — 255/N). If errors were in- 
dependent of pattern, so that all 256 patterns were equally likely, D 
would have zero mean (based on a binomial distribution, p = N/256, for 
N;) and a standard deviation of approximately 22.6/N (based on the 
related chi-square distribution, valid for large N, and verified by enu- 
meration as roughly correct for N = 2 and N = 38). 

The values of D fell into two groups: high (50 or above) and low (2.2 
or below). The only exception is one sample that has an intermediate 
value because it spans a transition from one type to the other. However, 
in almost all cases the value of D is much larger than 22.6/N, and hence 
significantly different from zero. The exceptions are of two types: small 
samples, and very high error rates (above 10~?). Pattern sensitivity 
showed no consistent relation to percent deletions. 

In the BPV runs, the bit patterns on the line were unknown. However, 
it could be presumed that some bit patterns might tend to recur peri- 
odically either at the 193-bit frame period or at the 8-bit word period 
within the frame. Accordingly, in each BPV sample each bit was assigned 
a frame position from 1 to 193, starting arbitrarily at the beginning of 
the sample, and a tally was made of the number of bipolar violations 
found in each of the 193 positions. These tallies were analyzed both 
numerically and graphically. 

As a general numerical measure of pattern sensitivity for BPV runs, 
the parameter 


- 193 198 No- 192 a 

N? j=1 N 
was computed for each BPV sample, where N; is the number of BPVs that 
occurred in the ith position in the 193-bit frame. This parameter has 
similar properties to the parameter D computed for the error samples. 
Specifically, its largest possible value is 192(1 — 1/N), and its standard 
deviation, for random errors, is 19.6/N. 
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Fig. 18—Computer printout of occurrences of errors within 8-bit patterns showing low pattern sensitivity in an error run (D = 0.5). 
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Fig. 19—Like Fig. 18, but showing high pattern sensitivity (D = 107.5). 


The values of D’ are significantly different from zero in most cases, 
and vary as widely as the values of D for the error samples. However, they 
do not fall clearly into two distinct groups; about half fell into the ““me- 
dium” range, between 2.2 and 50, in which D never fell. This may be 
explained by the fact that dependence of BPV rate on frame position 
depends not only on the error mechanism on the T1 line but also on the 
properties of the transmitted bit stream. The presence of frame peri- 
odicity in the bipolar violations actually depends on two factors: the 
probability of a bipolar violation must depend on the bit pattern, and 
recurrent patterns must exist in the D1 bank frame. 

It should be noted that bipolar violations are necessarily pattern- 
dependent, even if the errors causing them are not, because a bipolar 
violation can be detected only when a one is received. In the D1 channel 
bank format, some frame positions are more likely to contain ones than 
others; hence, bipolar violations will always be frame-position dependent 
to some extent. This factor may account for some of the observed 
frame-position dependence where this dependence is weak. However, 
it cannot account for the observed cases of strong frame-position de- 
pendence, because there are constraints in the channel bank that prevent 
long strings of zeros (as described in Section 4.5). 

Figures 20 through 22 respectively show examples of weak (D’ = 0.4), 
medium (D’ = 11.7), and strong (D’ = 108.5) dependence of bipolar 
violations on bit position in the frame. These figures represent 3-di- 
mensional bar graphs, the height of each bar representing the number 
of bipolar violations at each frame position. The lower left corner rep- 
resents the first bit in the frame, the first 8 bits are laid out from left to 
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Fig. 20—Three-dimensional bar graph showing weak dependence of bipolar violation 
probability on position in the D1 bank frame in a BPV run (D’ = 0.4). Frame alignment 
is hypothetical. 
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right, successive 8-bit words are laid out behind the first, and the 193rd 
bit is in the far left corner of the base of the diagram. Each row running 
from front to back thus represents a given bit position in each word. In 
these figures, the hypothetical starting point of the frame was shifted 
to the position that gave the figure the most regular appearance (by 
maximizing a parameter similar to D’, but based on the number of bi- 
polar violations in each position in the word), on the presumption that 
such a choice probably approximated alignment with the actual channel 
bank frame being transmitted. 

Figure 20 shows an example of relatively weak position dependence; 
bipolar violations occur in all positions, but their probability clearly 
depends on the position of the bit in the word. An example of medium 
position dependence is shown in Fig. 21. Fig. 22, where most of the bi- 
polar violations occurred in one position in the frame, shows strong po- 
sition dependence. Fig. 22 is an extreme case; the same line at another 
time of day showed an intermediate pattern in which no single bit po- 
sition had a majority of the BPvVs. Figure 22 necessarily indicates high 
sensitivity to some recurrent pattern. Figures 20 and 21, however, could 
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Fig. 21—Like Fig. 20, but showing medium position dependence (D’ = 11.7). 
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Fig. 22—Like Fig. 20, but showing strong position dependence (D’ = 108.5). 
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be explained either by lower pattern sensitivity (possibly none at all), 
or by high sensitivity to patterns that occur less regularly. Correlation 
with other properties of the error process, discussed in Section 4.6, 
suggests that the former explanation is applicable to cases such as Fig. 
20, and the latter to cases such as Fig. 21; but such conjectures are 
speculative. 


4.5 Effects of density of ones in the bit stream 


The pseudorandom sequence does not meet the constraints on mini- 
mum ones density for bit streams transmitted on T1 lines. These con- 
straints are intended to ensure that there is always sufficient energy in 
the timing tank circuit in each T1 line repeater. This tank circuit gets 
an input pulse whenever a one is received, and if the density of ones is 
too low the repeater timing becomes inaccurate and errors are more likely 
to occur. 

It was, therefore, expected that any given T1 line, driven by the 
pseudorandom sequence generator, would be more prone to make errors 
than when driven by a D1 channel bank, which does meet these con- 
straints. This expectation was not fulfilled. On different lines, the error 
rate for the pseudorandom sequence might be greater than, equal to, or 
less than the error rate for the D1 bank output. In most cases, errors 
occurred only when the signal source was a channel bank; sometimes, 
however, errors occurred only when the signal was supplied by the 
pseudorandom generator. This does not necessarily mean that T1 lines 
in general have higher error probability when driven by a D1 channel 
bank, because lines were usually selected for test on the basis of errors 
observed with a channel bank as the signal source, so that the sample 
of lines is biased. 

In addition, it was expected that errors would be more likely to occur 
in those parts of the pseudorandom sequence where the constraints were 
most severely violated. As described below, two of the lines tested showed 
this tendency to a remarkable degree, but the rest showed no such ten- 
dency at all. 

The pseudorandom sequence differs from the output of a channel 
bank both in maximum length of strings of zeros and in average ones 
density. The D1 channel bank must have a one in each word; the longest 
string of zeros that it can generate is 15. The pseudorandom sequence, 
on the other hand, contains eight strings of more than 15 zeros, including 
one of 19 zeros, in each repeated frame of a million bits (actually 
1,048,575 bits, about 0.68 seconds). 

The average ones density is nominally one-half for both, but in the 
channel bank output it varies from one bank to another, while in the 
pseudorandom sequence it varies over the million-bit frame. In the D1 
channel bank, the ones density depends on how the analog-to-digital 
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Fig. 23—Effective ones density for a timing tank with Q = 60, as a function of bit position 
in the pseudorandom sequence, in the vicinity of the string of 19 zeros, showing the location 
of the lowest effective density of ones in the sequence. 


encoders (of which there are two in each bank) are adjusted to encode 
the zero level. The intended zero code is 1000000, which leads to a low 
ones density. However, with a slight misadjustment, zero might be en- 
coded as the next lower code level, 0111111, which leads to a high ones 
density. The ones density is also affected by the signaling bit (a one for 
the on-hook condition) and by the encoding of noise and speech. One- 
second measurements on D1 channel banks in service have shown ones 
densities ranging from about one-fourth to three-fourths, with little 
variation over the day for any given bank.!° Other observations have 
shown ones densities as low as about one-eighth on T1 lines in ser- 
vice.!! 

In the pseudorandom sequence, the average ones density over the 
million-bit frame is almost exactly one-half (actually 524,288/1,048,575). 
The variation of ones density within the frame can be evaluated as a 
weighted average of the past bits, using exponential weighting with a 
time constant of Q/z bits (about 32 bits for the typical timing-tank Q 
of 100). This average is theoretically proportional to the amplitude of 
ringing in a repeater timing tank circuit having the specified Q. 

Figures 23 through 25 show, for three values of Q, the variations of ones 
density following the string of 19 zeros. (Since the ones density increases 
at each one and decreases at each zero, the strings of zeros can be iden- 
tified by their downward slope. The string of 19 zeros starts at the 
211,9938rd bit after the framing bit. The framing bit itself was the last 
in the string of 20 consecutive ones.) The ones density has an absolute 
minimum of about one-fourth in this segment, but the exact value and 
location of the minimum depends on the timing tank Q. 

Only two of the Tl lines that were tested had a noticeable tendency 
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Fig. 24—Like Fig. 23, but for timing tank Q = 100. Note the change in the position of 
the minimum effective ones density as well as the change in minimum value. 


to make errors in this part of the frame. One of these lines had an error 
rate of about 4 X 1078 for the channel bank pulse stream (55-minute 
test), but had a steady error rate of only 3 X 10-9 for the pseudorandom 
sequence. On this line, errors occurred most often on the sixth one after 
the run of 19 zeros, on the first or second zero after that, or the eleventh 
one after the 19-zero sequence. These would be the locations of the lowest 
timing tank amplitudes assuming a Q of about 100, as in Fig. 24. 

The other line, recorded at office 1, was a maintenance spare modified 
(as a stress test) by changing the line buildout (LBO: a circuit installed 
to equalize the loss of repeater sections of different lengths) in the office 
repeater. As installed, with an 836A LBO, the line made no errors. Sub- 
stitutions of 836B through 836E had no apparent effect, an 836F resulted 
in an error rate of 2.5 X 10~6 (several errors per frame), and with an 9386G 
the errors occurred too fast to be recorded. With the 836F LBO, errors 
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Fig. 25—Like Fig. 23, but for timing tank Q = 140. 
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Table VI — Frequency of occurrence of combinations of 
characteristics of errors of BPVs on 22 T1 lines”* 


Distribution Diurnal variation type 
type Normal Steady Irregular Exaggerated 


Unimodal m Hh m h 
Nearly unimodal m HH Lmmm 

Bimodal H m L 
Trimodal LLLLLI 


* The letter indicates run type and pattern sensitivity: capital = error run, lower case 
= BPV run; pattern sensitivity is high (H/h), medium (m), or low (L/l). 


occurred most often on the fourth and sixth ones after the string of 19 
zeros, and sometimes on the two zeros after the seventh one. The fourth 
and sixth ones would just follow the two lowest values of effective ones 
density if the timing tank Q were about 60, as in Fig. 23. 


4.6 Association between properties of the error process 


Table VI suggests that the different characteristics of the error process 
tended to occur in certain typical combinations. The Appendix describes 
how the statistical significance of the apparent tendencies was verified. 
The most consistent combination was normal diurnal variation with a 
trimodal interval distribution (remaining trimodal as the error rate 
varied) and weak pattern dependence. Every line with a trimodal interval 
distribution had normal diurnal variation. Conversely, nearly every line 
with normal diurnal variation had a trimodal interval distribution. Every 
line with a trimodal interval distribution also had weak pattern depen- 
dence. This combination occurred in both error and BPV runs. 

Another frequent combination was steady error rate with strong 
pattern dependence and a unimodal or nearly unimodal interval dis- 
tribution. Every line with a steady error rate had strong pattern de- 
pendence. Every line with strong pattern dependence (except the two 
lines that made errors only at the minimum of ones density in the 
pseudorandom sequence) had a unimodal or nearly unimodal interval 
distribution. 

Other combinations were less consistent. A substantial number of lines 
had irregular variation of error rate, most of them in BPV runs; these lines 
had either unimodal or bimodal interval distributions and usually 
showed an intermediate degree of pattern dependence. Both the irregular 
variation and the ambiguous indication of pattern dependence might 
be explained by the variability of the channel bank output bit stream. 

These observations indicate that at least two different error mecha- 
nisms can be observed. In one, the error rate varies with traffic, but de- 
pends very little on the bit pattern on the line, and two levels of error 
clustering occur as indicated by the trimodal interval distribution. In 
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the other, the probability of error is very much dependent on the pattern 
of bits on the line, the error rate remains steady as long as the same bit 
stream is repetitively transmitted, and there is little or no error clus- 
tering. It might be conjectured that errors are related to switching noise 
in the former type, and to intersymbol interference in the latter. 

Occasional short intervals of very high error rate occurred in several 
lines. Samples during these intervals usually showed measured error 
rates above 10~?, and sometimes above 0.1, with little or no pattern de- 
pendence, and about 50 percent deletions in the error runs. It is clear that 
in these events the errors were quite unaffected by the pattern of bits 
on the line, but the cause is unknown. 

The great difference between the error rate in the pseudorandom se- 
quence and the error rate in the channel bank bit stream is unexplained. 
It probably is not due to short term effects of ones density (on the time 
scale of variations in the timing tank output), as discussed in Section 4.5. 
On lines that show strong pattern dependence, BPV runs and error runs 
might be expected to give different results. But it is not clear why a line 
can show BPVs in service with very little pattern dependence (evidenced 
both directly by the distribution of errors over the frame, and indirectly 
by normal diurnal variation), and yet transmit the pseudo-random se- 
quence without error. 


4.7 Practical implications of the detailed recording results 


The clearest general implication that can be drawn from the detailed 
error recordings is that error rate measurements on digital facilities 
should be interpreted with caution. For many possible reasons, the error 
rate measured on a T1 line may be quite different from the error rate 
experienced when the line is used for communication. 

An error rate measurement made at night, especially in the early 
morning hours, can be misleading because many lines consistently have 
much lower error rates at that time than during the business day. Fur- 
thermore, since some lines have irregularly varying error rates, a single 
measurement during the business day would not necessarily show the 
typical performance of a line. It would seem that continuous monitoring, 
or at least frequent sampling, would be required to evaluate the error 
rate performance of an individual line. 

It may also be misleading to test a line with a special test signal, or with 
any signal that is different from the communication signal that it nor- 
mally carries. The measurement program showed conclusively that on 
many lines the error rate for a pseudorandom test sequence is quite 
different from the error rate for the D1 channel bank output carried by 
the line in service. Hence the error rates for a digital data signal, or for 
another channel bank output (either a different type, or a different unit 
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of the same type), might be different from the error rates measured either 
with the line in service, or driven by a ‘“quasirandom signal source” (a 
single generator used in telephone central offices, which generates a 
sequence similar to our pseudorandom sequence, except that ones are 
inserted to avoid long runs of zeros). However, the properties of the signal 
that affect the error rate remain unidentified. 

Other implications are less clear. The relationships observed among 
diurnal variation, pattern sensitivity, and error-free internal distribution, 
suggest the existence of at least two clearly distinct causes of errors on 
T1 lines. It has been suggested that these are the same as the two major 
sources of error considered theoretically by Cravis and Crater:* The 
errors characterized by normal diurnal variation and low pattern de- 
pendence would be attributed to impulse noise originating in switching 
machines, which can be strong enough, when it occurs, to cause errors 
regardless of the bit pattern; the errors characterized by high pattern 
dependence would be attributed to crosstalk, which would cause errors 
mainly in those patterns in which intersymbol interference was most 
severe. This appears plausible as a tentative hypothesis. However, no 
attempt was made during the measurement program to identify the 
causes of the errors on individual lines, because of the extensive effort 
that would have been involved. 


V. CONCLUSIONS 


The T1 error measurement program of 1973-74 has resulted in both 
a survey of the population of T1 lines and some detailed observations 
of the error process. 

The survey is in some ways less detailed, less systematic, and less ac- 
curate than previous (unpublished) surveys, but is probably overall the 
best overview we have of the error performance of T1 lines (when ter- 
minated predominantly with D1 channel banks). The large differences 
between offices show that a trend in the distribution of error rates on 
T1 lines, as a function of either time, or growth over time, cannot be in- 
ferred from comparison of surveys taken at different offices at different 
times. In other respects our results are consistent with previous surveys. 
Detailed conclusions were discussed in Section 3.3. 

The detailed error recordings have shown a few remarkable results. 
The fact that many T1 lines can make errors in service without making 
any errors on a pseudorandom signal is both unexpected and unex- 
plained. Another notable result is the observable existence of at least 
two different patterns of errors: one with a seemingly traffic-related 
diurnal variation, two levels of error clustering, and less sensitivity to 
the bit patterns on the line; the other with simpler patterns of variation 
and clustering, but more sensitivity to bit patterns. Another result is the 
fact that errors are always dependent to some extent on the bit patterns 
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on the line, except at very high error rates. Detailed results of the in- 
tensive error measurements were discussed in Section 4.6. 
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APPENDIX 
Statistical Significance of Apparent Association of Error Characteristics 


The statistical significance of the frequencies of particular combina- 
tions in Table VI was verified by a test based on the chi-square test for 
independence in contingency tables. In order to conclude that Table VI 
shows an actual tendency toward certain combinations we had to de- 
termine that such an apparent tendency could not easily have occurred 
by chance. Two problems were met in the testing procedure. First, the 
null hypothesis to be tested must not be the hypothesis that all four di- 
mensions in Table VI are independent, because the apparent tendency 
for certain characteristics to be associated with BPV runs is explainable 
as a property of the measurement technique; the test must allow for this 
and determine whether any further interdependence can be deduced 
from the data. Second, the sample size is so small, and the number of cells 
so large, that the chi-square distribution is not applicable. 

The first problem was not to be solved by simply combining the error 
runs with the BPV runs, because this would leave medium pattern sen- 
sitivity apparently associated with irregular diurnal variation, actually 
because both are associated with BPV runs. Instead, the dependence of 
pattern sensitivity on run type was allowed for by considering the five 
observed combinations of these variables as a single variable. The de- 
pendence of diurnal variation on run type was then removed from the 
table by combining the steady and irregular types into one. The resulting 
3-way contingency table is shown in Table VII. 
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Table VI] — Three-way contingency table derived from Table VI 
and used in statistical testing of the significance of the results * 


Run type 
and pat- Distribution type 
tern early 
sensiti- uni- Estimated 
vity Unimodal modal Bimodal Trimodal Total probability 
error, high 1V 2V 1V 0 4 0.182 
error, low 0 1V 1X 5N 7 0.318 
BPV, high 1X,1V 0 0 0 2 0.091 
BPV, medium 1N,1V1N,3V 1V 0 7 0.318 
BPV, low 0 0 0 2N 2 0.091 
Total 5 7 3 7 22 
Estimated 
probability 0.227 0.318 136 0.318 1.0 


Total of N = re V=11, xX =2 
Estimated probability of N= .409, Ves .500, X = .091 


* N = normal diurnal variation, V = variable (steady or irregular), X = exaggerated. 


The estimated probabilities for each type in Table VII were set equal 
to the relative frequencies. As in an ordinary chi-square test for inde- 
pendence, the probability of each of the 60 possible combinations of types 
was derived (by a computer program) by multiplying the corresponding 
type probabilities. For example, the probability that a line would com- 
bine unimodal distribution type (P = 0.2), normal diurnal variation (P 
= (0.45), and “error high” run type and pattern sensitivity (P = 0.2), is 
0.2 X 0.2 X 0.45 = 0.018. The expected number of occurrences of each 
combination, in 22 lines, is 22 times the corresponding probability. The 
statistic called chi-square, measuring the deviation of the observed 
numbers, o;, from the expected numbers, e;, is derived as 


60 
x7 = > (e;— 0;)2/e; 
= 


For the actual observed numbers, chi-square was 88.04. 

In a large sample this value would be compared with critical values 
based on a chi-square distribution with 50 degrees of freedom (the 
number of combinations of types, minus one, minus the number of in- 
dependent probability values—not counting the ones determined by 
the constraint that probabilities must add up to 1—that were estimated 
from the observed data). If we did this we would conclude that a value 
of chi-square as large as 88.04 would occur by chance, if the three prop- 
erties were independent, with a probability of about 0.01 percent, and 
that the deviation was therefore certainly significant. However, this 
reasoning is not valid, because with a sample as small as 22 the chi-square 
statistic does not have, even approximately, a chi-square distribution. 

A Monte Carlo method was therefore used to estimate the relevant 
distribution. The model that was simulated on a computer considered 
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three independent properties (diurnal variation, waiting time distri- 
bution, and run-type combined with pattern sensitivity), each property 
having 22 perdetermined outcomes distributed according to the type 
totals shown in Table VII. To implement each trial, a random-number 
generator assigned one of the 22 outcomes for each property (by drawing 
without replacement) to each of 22 lines. In 10,000 trials, 62 trials had 
values of the chi-square statistic greater than 88.04. 

We could thus estimate that, under the null hypothesis, the probability 
that the chi-square statistic would exceed 88.04 is 0.0062, but this esti- 
mate has some uncertainty that we cannot easily estimate or allow for. 
The result, therefore, was interpreted by the following reasoning. If the 
‘ null hypothesis is true—that is, if the three properties examined are 
independent—the single field experiment and the 10,000 Monte Carlo 
trials are replications of the same experiment, and the value of chi-square 
for the field experiment was among the 63 highest values in 10,001 trials. 
This would be an event with probability 0.0068, or 0.63 percent. This is 
small enough to consider that the deviation from independence is sta- 
tistically significant. 

A similar procedure was followed with another statistic, max |o0; — e;|. 
This had a value of 4.09 for the field trial, which was equaled in 3 out of 
10,000 computer trials (and never exceeded). Under the null hypothesis 
this would be an event with probability 0.0004, or 0.04 percent, showing 
even more clearly a significant deviation from independence. 

Having concluded that independence of the properties does not ac- 
count satisfactorily for the observations, we are then justified (by Oc- 
cam’s razor) in adopting the simplest hypothesis that does account for 
them, which is that we have observed two different types of error process, 
each having different probabilities for the properties. 


REFERENCES 


1. K. EK. Fultz and D. B. Penick, “The T1 Carrier System,” B.S.T.J., 44, No. 7 (September 
1965), pp. 1405-1451. 

. H. Cravis and T. V. Crater, “Engineering of T1 Carrier System Repeatered Lines,” 
B.S.T.J., 42, No. 2 (March 1963), pp. 431-486. 

. J. S. Mayo, “A Bipolar Repeater for Pulse Code Modulation Signals,” B.S.T.J., 41, 
No. 1 (January 1962), pp. 25-97. 

. P. C. Lopiparo, to be prepared. 

. J.J. Mahoney, Jr., J. J. Mansell, and R. C. Matlack, “Digital Data System: User’s View 
of the Network,” B.S.T.J., 54, No. 5 (May-June, 1975), pp. 833-844. 

. A. K. Reilly, unpublished work. 

. J. E. Kessler, unpublished work. 

. E. W. Geer, Jr., unpublished work. 

. 5S. W. Golomb, Shift Register Sequences, San Francisco: Holden-Day, 1967. 

. H. Hagen, unpublished work. 

. R. F. Ewald, unpublished work. 


SB OOMOAIMD Oe (sw) i) 


oe 


746 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


/ 


Copyright © 1978 American Telephone and Telegraph Company 
THE BELL SYSTEM TECHNICAL JOURNAL 
Vol. 57, No. 3, March 1978 
Printed in U.S.A. 


On the Stability of Higher Order Digital Filters 
Which Use Saturation Arithmetic 


By J. E. MAZO 
(Manuscript received July 15, 1977) 


The device of using “saturation arithmetic” to cope with adder over- 
flow in recursive digital filters has, for a number of years now, been 
known to yield stable operation when the filter is of second order and 
is linearly stable. Mitra has recently given examples to show that this 
happy situation does not prevail for higher order filters. Here we in- 
vestigate conditions on the filter coefficients which would guarantee 
stability for higher order filters using saturation arithmetic. We are 
only able to give sufficient conditions for stability. These conditions 
in their simplest form can be written as linear inequalities involving 
the coefficients of the filter. 


I. INTRODUCTION AND SUMMARY 


We shall be concerned with real nth order nonlinear difference 
equations of the form 


yk+n)=f| Saye +n—i |, 204129, x. (1) 
i=l 


The variables y(-) and the coefficients a; are real. The initial conditions 
yQJ),J = 0,1,...,(n — 1) are arbitrary, subject only to the important 
condition |y()| < 1. The function f(-) will be assumed here to have the 
form given in eq. (2): 


f(x) =x, |x| <1 
f(x) = sgn x, |x| >1 (2) 


This function models a method of handling overflow in the practical 
implementation of digital filters and in that literature is referred to as 
“saturation arithmetic.” An important unsolved problem is the as- 
ymptotic stability of this undriven system. Specifically we would like 
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to describe the region in “a-space”’ (i.e. {a;},i = 1,...,n) for which 


lim y, = 0 

ieee 
for any initial conditions. We always assume that the a;’s are already 
restricted so that linear stability holds. That is, if f(-) were replaced by 
the identity function the system would be stable. This is equivalent to 
all roots \; of the characteristic equation 


e(z) =z" —- 3 ajz"-! = 0 (3) 
i=1 


satisfying |\;| < 1.* Since {a;} are real, the complex \; occur in conjugate 
pairs. 

The case n = 2 has special importance to a certain strategy for 
implementing digital filters and has been considered earlier.!-° It was 
shown for this case that eq. (1) is stable for saturation arithmetic 
whenever the system is linearly stable. Recently Mitra* has shown by 
example that a result of this generality does not hold for any n > 2. This 
surprising development has regenerated the author’s interest in the 
problem in its own right. In addition, direct implementation of digital 
filters of the form [eq. (1)] for n > 2 is now of interest, so questions of 
stability must be answered. Saturation arithmetic seems to be an ex- 
perimentally favored procedure at the moment. 

Our main results, Theorems I and II, provide sufficient conditions that 
a given set of coefficients {a;}{ yield a stable filter with saturation 
arithmetic. Both theorems require one to test if a pair of linear ine- 
qualities in a set of variables can be satisfied when the latter lie in a hy- 
percube of (at most) dimension n. A finite algorithm which is sufficient - 
to decide this question is given in Section IV. While we feel that Theorem 
II will give more powerful results (i.e., determine a larger stability region), 
Theorem I allows one to list a set of linear inequalities in the a;, which, 
if any one is satisfied, would guarantee stability. This result is given as 
Corollary I. 


ll. SOME LINEAR AND NONLINEAR THEORY 


If one were concerned with the linear version of eq. (1) [f(-) equal to 
the identity function], the solutions would be 


y(k) = 5 RAE R=0,1,... (4) 
t=1 


* Under this condition, stability of eq. (1) with |y;| < 1 and 
m 
2X Jaz} <1 


is trivial, since the system is then linear. 
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where the ); are the characteristic roots of eq. (3), assumed to be distinct 
in writing the solution eq. (4). The constants k; are determined by initial 
conditions, or equivalently, the values initially stored in the registers. 
Suppose we wish to solve for the k; in terms of y(0),..., y(n — 1). This 
clearly requires the inversion of the matrix 


Lk 20 


MG An 
V(A) = Xi 2 
iol Mee 


called a Vandermonde matrix. A brief discussion of these matrices is 
given in the Appendix, as well as some notation we shall use related to 
them. 

The linear difference equation can also be written in matrix form if 
we introduce the n-vectort 


y(k) 


y(k +1) 


Y(k) = ,- eH 0, Leyes (5) 


y(kR +n — 1) 


or, in component form y;(k) = y(k +1 — 1),i =1,...,n. The “time” 
argument is indicated by the discrete index k. The equation-of-motion 
is then 


Y(k +1) =AY(k) (6) 
0 1 0 — 0 
0 0 1 0 
AS) 0 0 0 (7) 
0 0 eles 0 1 
an Qn-1 acant a9 a1 


Expanding the determinant of the matrix A — XJ (J being the identity 
matrix) by the last row, we obtain (except for asign) the polynomial eq. 
(3), and, not surprisingly, the eigenvalues of A are the roots \; mentioned 
earlier. 

If these eigenvalues are distinct, a well-known theorem of algebra 
guarantees that there exists a nonsingular matrix P such that 

P-'AP=A (8) 


where A is simply the diagonal matrix of eigenvalues of A. 
* Vectors and matrices will be denoted by capital letters. 
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Theorem: If the eigenvalues of A are distinct, then 
V-1AV=HA (9) 
where V is the Vandermonde matrix formed from the roots of the 


characteristic eq. (3). 
The choice P = V is not claimed to be unique. 


Proof: 


n n a ri, Lem =-1 

D aijvjr = DO ay = 

Jo. J=1 ue ‘J ‘ : 

>» Anti-j\h = AR fort =n 
j=l 


The second line in the last member makes explicit use of the fact that 
Ax is a root of the characteristic equation. Thus 


n n n 

A =. = _ 

Db ve; e aij0jn =D vg (Anvir) = Ander 
t=1 J=l1 i=1 

as was to be shown. 


One reason for wishing to diagonalize A in the linear case is the simple 
form that the equation of motion takes. If we multiply eq. (6) by V~! and 
perform the standard trick of inserting J = VV~! after the A in eq. (6), 
we obtain 


Z(k + 1) = AZ(k) (10) 

where 
Z(k) = V-1Y(k) (11) 
Since A is diagonal, the solution for the ith component of Z is simply 
zi(k) = dfz;(0) (12) 


Turning now to nonlinear problems, we wish to summarize some re- 
sults from Liapunov stability theory,’ without proofs, and without 
complete generality.* 

We are concerned with an autonomous (time independent) nonlinear 
difference equation 


Y(k + 1) = F[Y(k)] (13) 


where F is a nonlinear (or linear) vector function of the vector Y(k). 


* A.N. Willson was the first to explicitly apply Liapunov theory to the present problem 
ee 2.3 He has also attacked other stability questions for n = 2 with these methods in 
ef. 8. 
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If 
lim Y(k) = 


k->o 
then the system is called asymptotically stable. 


Theorem (Liapunov): If there exists a (strictly) positive definite qua- 
dratic form w[Y], such that, for any allowed n-vector Y, w[F(Y)] — w[Y] 
< 0 (strict inequality) then the system is asymptotically stable.* 

In other words, if we can find a positive definite quadratic form (of 
the state variables of the system) which is always decreasing as the mo- 
tion proceeds, then the motion must proceed to the origin. The function 
w|-| is called a Liapunov function. 

In terms of f(-) and A the function F() is determined by 


[F(Y)]i= [AY], t=1,...,Q2-D) 
[AY], if |[AY],| <1 
en[AY]n, otherwise 


As asimple example of a Liapunov function consider the linear case 
with nondegenerate eigenvalues, and again set Z = V—!Y. Choose 


[F(Y)]n = (14) 


w= > |z;|? (15) 
i=1 


the z; being regarded as functions of the y()),j =0,...,2 —1. We know 
that when Y — AY we have Z — AZ and so 


n 
w > © [lel 
t=] 
Since we assume | \;| < 1, strict decrease of w is assured. 


lil. A SPECIAL LIAPUNOV FUNCTION 


For the nonlinear problem eq. (1), we shall, for a first pass, choose a 
w|-| whose form is inspired by the one just described. Noting from the 
Appendix 


(-1)'v (a) 
Vit Y]s age XS (-1)ip (Ardy; 
itn, (i) n 
apes LépP(Ny(n —L— 1) (16) 
v(A) b= 


we shall single out the functionals 


+ For our problem any vector Y is allowed aon has components |y;| < 1. We identify 
Y with Y(0) and so y; = yi — 1),7 =1,. 
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xj = = (—1)£pP(\)yn—2 = > (~1)2pP(yn —2-1) (17) 


for special attention. Clearly the equation of motion for the Z variables 
(10) implies that under the substitution Y — AY we have x; — d;x;. We 
choose 


n 
w[Y] = u laa? (18) 
l= 
where the x;, via eq. (17), are regarded as functions of y;,i = 1,...,n, 


the components of Y. Of course we always have |y;| < 1. 
In order to investigate the consequences of the (sufficient) stability 
condition 


w|[F(Y)] — w[Y] <0 (19) 


we note that under Y — AY we have x; — x!) (L stands for linear) 
where 


x) =" (-1)£py(n — 2) (20) 
£=0 
and 
y(n) = ¥ agy(n~ 2) (21) 


Finally the nonlinear (NL) “version” of eq. (20) is 


(NL) = x) if |y(n)| <1 


sgn y(n) + = (-1)2pPy(n- 2) if |y(n)| > 1 


(22) 


where we made use of the definition of F(-), and the fact that po'(- ) 
= 1. Then 


w[lF(Y)] =X [x2 (23) 
i=1 


We have already noted that if |y(n)| < 1 we have a linear iteration 
[F(Y) = AY] and 


w[F(Y)] — w[Y] = w[AY] — w[Y] 
= ¥ |al2lx]2—- 5 [xil2<0 (24) 
i=l i=1 
Hence we will only be concerned with x!" when |y(n)|> 1. In this case, 
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w[F(Y)] is a function of y(1), y(2),..., y(n — 1), while the condition y(n) 
> 1 involves y(0) as well.* 

We shall not use the stability condition in the form of eq. (19), but 
instead note that since 


n 
dX ery lel le (25) 
= 
the condition 
> [xfXP]2- y [x!]2 <0 when y(n) > 1 (26) 
i=l i=1 


is sufficient for stability. At the price of losing some power in the method 
we shall see momentarily that we have gained considerably in analytic 
simplicity. For convenience define 


a= —"E (-I4y(n — 2pPO) (21) 


so that [when y(n) > 1] 

xf?) = y(n) — ¢ 

x{NL) = 1 —¢; (28) 
Then 


: a 3 |x™|? = 2 (y(n) — 1)(y(n) + 1 — ¢; — 7) 


=i) -)[no@+y-226] eo) 
We have used the fact that complex ); occur in conjugate pairs and so 
x PPO) = x pP*(a) 
Thus if the inequalities 
2 = c; ~-n(1+y(n))>0 


y(n) >1 (30) 


have no solution in the n-cube |y(i)| < 1,i = 0,1,...,n — 1 the nonlinear 
equation (1) is stable. More explicitly by using the definition of the c; 
[eq. (27)] Lemma I in the Appendix coupled with (67) to express D?_, ¢; 
in terms of the a;, and using finally eq. (21), we have: 


* By symmetry of the problem, |y(n)| >1is here and henceforth replaced with y(n) >1. 
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Theorem I: If the two inequalities 


n 
> (n-2b)agy(n-L)2n (31) 
e=1 
Dd aey(n — £)>1, (32) 
e=1 
cannot be simultaneously satisfied by some set of y(i) in the cube |y(i)| 
<1,1=0,1,...,n —1, then the system (1) is stable for saturation 
arithmetic. 


We note that if the inequalities are satisfied, no conclusion is 
drawn. 

A systematic algorithm for checking the above inequalities is given 
in the next section. Here we deduce the simple: 


Corollary I: If the coefficients ag satisfy 

> lael|n - 2¢| <n (33) 
then eq. (1) is stable. If the coefficients a; satisfy 

2 lag||k — £| <k (34) 


for at least one k, [n/2] <k <n, eq. (1) is stable.* 

The first inequality follows immediately from the first inequality in 
Theorem I plus the fact that |y(i)| < 1, alli. The remaining inequalities 
follow from the observation that in eq. (31), the coefficients of ag have 
the same sign for @ < [n/2], whereas for € > [n/2] they have opposite 
signs. Thus if eqs. (81-32) have a simultaneous solution, so do 


>~ = lael(n — 22) + S (n—2l)agy(n-#)=n (85) 
1<¢S[n/2] e>[n/2] 
lac] ++ SY aey(n-2)>1 (36) 
é<[n/2] £>[n/2] 


where the above is obtained by setting y(n — @) = sgnag,1< @ < [n/2]. 
If, for k > [n/2], we multiply the second inequality by (2k —n) > 0 and 
add the result to the first we obtain 


DY lal(k-£)+ Y (k-2Ljagyin-L)2k (37) 
é<[n/2] £>[n/2] 


which, if |y(i)| < 1, cannot possibly be satisfied when 


* The notation [x] denotes the integer part of x. 
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Ss lae||k - 2] <k (38) 
=1 


There is one important point to be noted here. Our results have only 
been proven for the case of nondegenerate eigenvalues. For degenerate 
eigenvalues, the Liapunov function that we have chosen in this section 
is not strictly positive definite. We have in fact constructed Liapunov 
functions which are specifically designed to handle the degenerate case. 
Using them, we have proven that the conclusions are still true for de- 
generate eigenvalues. We believe that the form of the results, being 
simple conditions on the ay’s, will allow the reader to readily accept that 
they are true in general. Since our proof of the extension is long and out 
of proportion to its importance, we have chosen to omit it. 


IV. HYPERPLANE ALGORITHM 


Let {b;}#; and {c;}*,, £ and n denote fixed constants. We wish to de- 
termine when it is possible to simultaneously satisfy the inequalities 


k 
yen 
fel 


k 
zicj 2 0 
ise 


fee eel GD Cae ke (39) 


The dimensionality of the problem is immediately reduced if sgn b; = 
sgn c; for some j since we may immediately take z; = sgn bj. It is im- 
portant to note that we assume this to have been done and therefore 
assume b;c; < 0,1 <i SR. 


Lemma: If the simultaneous inequalities eq. (39) are satisfied then there 
exists Z;, andaj,1 <j) < k, such that 


a 
Zid; 


IV 


g 


~ 
ll 
— 


IV 


Zc, = y 


ime 


~ 


|Z; | < 1, lZe| =lall? #j] (40) 
In other words all but possibly one of the coordinates may be given values 


el. 


This is geometrically evident if k = 2. If k > 2 one need only consider 
the z; variables two at a time, always applying the Lemma for k = 2. 
Eventually all but perhaps one of the z; will have value +1. 

Continuing with the description of the algorithm, let E be any k-vector 
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having components eg = +1; there are 2 such E vectors. Choose aj and 
let’s test if indeed it is the j of the Lemma. Let Z¢ = cg, 2 ¥ J. Thenif eq. 
(40) has this solution we have 


lA 


E — D’beeg . oo Dele 


ifb; >0 41 
b, a if b; (41a) 


or 
= / < / 
DS OE pg i 2G (41b) 
Cj j 
If these inequalities are consistent (i.e., the upper bound is at least as 
big as the lower bound) and if they can be satisfied by some Z;, |Z;| < 
1 we are done—the simultaneous inequalities are satisfied. If not, try 
another E vector, or another j. For a given j there are 2"~! E vectors to 
try. Hence after at most k-2*-! such attempts we have exhausted all 
things that need to be checked, and checking the inequalities is, it has 
turned out, a finite procedure. 
This procedure is not only applicable to Theorem I, but also to The- 
orem II occurring in Section V. 


V. ANOTHER LIAPUNOV FUNCTION 


We have already noted that the entire sequence {y;}5 is determined 
by the first n elements. For our second choice of the Liapunov function 
we chose the expression for the energy in the remainder of the sequence 
for the linear problem: 


w[Y] = > y2 (linear case) (42) 


The right member is regarded as a positive definite quadratic form in 
Y(0). In the linear case we also have, numerically, 


w[AY] = > y? (43) 
nt+1 


which is smaller than w[Y] by y2(n). Thus this w[-] doesn’t necessarily 
decrease after every iteration and thus it is not strictly a Liapunov 
function. However after at most n iterations it must decrease (unless all 
y; = 0) and the effect will be the same. We shall have stability if we can 
show whenever y(n) > 1, that 


w[F(Y)] —w[Y] <0 (44) 
or, equivalently 


w[F(Y)] — w[AY] < yn (45) 
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We begin by writing down the generating function for the sequence 


y(n), y(n + 1),..., when linear theory holds. By definition 
H(z)= > y(n +h)z* (46) 
k=0 


Using standard linear techniques we calculate from eq. (1) and its initial 
conditions 


n 


3 y(n —j) © ajegz? 


H(z) == = (47) 
n 
_— SS a,zs 
s=0 
where we have arbitrarily defined ag = —1. We note the characteristic 


polynomial 
n . 
eZ) == aig? 
i=0 
has the same modulus as the denominator of H(z) when |z| = 1 (since 
the a; are real). 


We next note that 
3 1 1 . 
Y yk=5- f" |H@ = e!*)/2a0 (48) 
k=n 2Qr = 


To express this as a quadratic form introduce, for 0 < |s —t| < (n — 1) 
the integrals (z = e!®) 


pet rath gga Pee ao (49) 
ae a nen 2 Qa J—r} 2 2 


Asz 8 Die 4 
s=0 s=0 


Also, as a contour integral around the unit circle, we have 














1 gnts—t—-l 
Ls. = ee 50 
Oni Dae a0) 


The first form of the integrals shows they are real and J,; = [;,. Also in- 
troduce the real symmetric matrix 


n-jn-k 


Ayr, = Ds DS jue pael st, J, k = 1, 22-57 (51) 
s=0 t=0 


Note for j = k, Hj; > O since the right side of eq. (51) is then a Teoplitz 
form with positive spectral function. Then with these notations simply 
using eq. (47) to expand the integral in eq. (48) yields 


w[Y]= ¥ y= 2 yin —jyln — RH (52) 


Jk=1 
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or 


w[AY]= YS ye= VL y(nt+1—j)y(n+1—k)Hj, (53) 
k=n+1 jk=l 
When y,, > 1, w[F(Y)] has the same form as eq. (53) except that y,, is 
replaced by unity. Thus in evaluating w[F(Y)] — w[AY] [by using eq. 
(53)| most of the quadratic terms will cancel. Doing this and a few minor 
manipulations, the criterion for stability will read that we must have 


Mot heees HaeGsisiee = «6H 
j=2 y(n) -1 
whenever 
y(n) = ¥ ay(n—j)>1 (55) 
i ee 


If we define H,,,41 = 0, we may write this as: 


Theorem IT: If there are no simultaneous solutions to the inequalities 


n : 2 
2 lg — 2h aaly(n 1) = Aa + ave (56) 
i=1 Vin) ak 
y(n) = = ayy(n —j)>1 (57) 
I= 
ly@)| <1 i=0,1,...,n-1 (58) 


then the difference eq. (1) is stable with saturation arithmetic. 


To convert this into a hyperplane problem (discussed in Section IV) 
the nonlinear term y?(n)/(y(n) — 1) may be dropped or replaced by the 
value four (since 





52 
2A4 
praca 8 

when x 2 1). If we drop this term, the resulting stability has the physical 
interpretation that at any time the energy in the remaining tail of the 
nonlinear response is less than or equal to the corresponding energy for 
the linear problem, regardless of the previous state. That is, if measured 
by energy, the nonlinear undriven response dies off at least as fast as the 
linear response for any initial conditions. 

We also note that whenever the Liapunov function w = YtHY [eq. 
(52)] drops to the value unity the system behaves as a linear one from 
there on (no future yz will exceed unity). Several bounds for this quantity 
may be given. Since 
2 

= J] 


n 
t=1 





a2" 
s=0 





j1 —zaA;|2= I] a- |A;])? 
t=1 


758 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


we have 


1 © 
ee. d0|H(z)|? < max |H(z)|? 
20 7 z=eld 


n n-j 9 

XL yin J) X ajssz° 

= =0 

= max J : 
z=el8 n 

= os a.2° 
0 








n Of ae 2 n 2}n-1 } 2 
¥ lym aed] |S tal |S wa 
i jJ=1 s=0 2 j=1 i=0 


(59) 


(l= [A; |)? 


n n 


(P=Aq |) 


Vi. EXAMPLES 


The few simple examples of this section will shed light on the two 
methods we have given. When n = 2 the two inequalities of Corollary 
Tare simply |a2| < 1, |a1| < 2. Since linear stability implies |a,| = |\z 
+ do] < 2, |a2| = |A1A2| < 1, we see that for n = 2 linear stability implies 
stability with saturation arithmetic. We have already mentioned this 
is not true for n > 2. Mitra has constructed a counter example using the 
degenerate case \j = Ag =...= A, = y. Forn = 3 he finds oscillations 
if |y| = 0.858, although if |y| is smaller than this stability is not implied. 
If we consider the second inequality of Corollary I for n = 3,k = 2,a,= 
BY, Ag = —3y2, a3 = y? we have stability if 


3ly| + ||? <2 


or |y| < 0.596. No better result is obtained for this case by a complete 
use of Theorem I. 

On the other hand if we apply the criterion of Theorem II (neglecting 
the nonlinear term) with the algorithm of Section IV, we find stability 
if |y| < 0.71. Insignificant improvement would be obtained here if we 
had also included the nonlinear term. The application of Theorem II to 
the present case was sufficiently simple so that the calculation could be 
done by hand. The integrals were done exactly to give 


2 
Ay = =~ [9 — 9A2 + 10A4 — BAS + VB 
ogee 
—33 
Hy. => (8+ 
earEmre Ca 
HD 
Ais = = (1+ 60 
We qaae (60) 
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Clearly in general the integrals would have to be done numerically. The 
fact that the nonlinear term did not contribute significantly is due to the 
combination of facts that at the critical value for \, y(n = 3) is not ex- 
tremely close to one and also H} is large, due to its denominator. 

It should be pointed out that 


can be expressed using the solution eq. (4) for the linear problem as 


Sy? = Y+(V-)IPV-1Y (61) 
where 
- nyo 
ry =e (62) 
Ly yy 


However this explicit form is only true for the nondegenerate case and, 
although all limits exist as \; — dg, etc., the expression would probably 
not be suitable for numerical computation when eigenvalues are close 
to being degenerate. 
Another example of limiting misbehavior being only apparent is that 
in a similar manner one could compute that 
n 1 
H(z)= > 


iz1 1-2); 





(V-TY); (63) 


Individual terms in this expression are badly behaved as, for example, 
if \y © Ao, but the alternate form eq. (47) shows everything is well be- 
haved in the limit. 

If we return to Corollary I applied to n = 3 when (Aq, dg, A3) = pli, —1, 
1),0 < p <1, we see that the inequality 


|a,| a |a3| <2 


is sufficient to guarantee that for filter poles in these relative position 
saturation arithmetic will give a stable filter for any p, all the way out 
to the boundary of linear stability. 

Based on these examples we feel that Theorem II is the more powerful 
method although the simpler Corollary I can yield considerable infor- 
mation for particular cases. 

Finally, we note that an important investigation on the present 
problem has just been completed by Mitra,? resulting in different sta- 
bility criteria from those presented here. Mitra’s results will give a 
polynomial type criterion for absence of periodic oscillations. These 
results in themselves do not prove stability in that Ref. 9 does not pre- 
clude unending periodic outputs with no input. However, we take the 
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liberty of mentioning that Mitra has extended the proof to include sta- 
bility and thus the results of Ref. 9 may be taken as proving the same 
type of stability as discussed here. Another comparison with Ref. 9 in- 
volves the size of the stability region in “tap-space” which the two 
methods give. Neither Mitra’s criterion nor ours can claim to describe 
the largest stability region. Also it does not even seem possible at this 
stage to give theoretical arguments to decide if one of the methods is 
always superior in this respect. However, several examples indicate that 
the region determined by Mitra’s criterion is larger. Assuming this to 
be the case in general, an effective practical procedure would be to first 
test for stability using our simple Corollary I, and if this fails, apply 
Mitra’s polynomial test. 


APPENDIX 
The Vandermonde Matrix and Symmetric Polynomials 


The a denote an ordered set of n complex members aj, i.€., a1, @2,..., 
Qn. By the Vandermonde matrix V(a), we shall mean the matrix 


1 1 1 ae 1 
ay a9 a3 Bagh « A 
_ 2 2 2 2 
Via) =| at as a3 eke ee (64) 
a? —1 age ats -1 ane! 


This can be written [V(a)]ij = a} i,j =1,...,n. We let v(a) = det 
V(qa), and it is known? that 


v(a) = II(aj — aj) (65) 
where the product extends over all i, j satisfying 
Ls=7<j sn 


If a; ¥ a; fort + 7 then the inverse of V(a) exists, and is known. Before 
giving its structure, we wish to list some facts concerning some special 
symmetric polynomials.® 


Definition: The @th elementary symmetric function of n-variables (2 
= 1,2,...,n) is the sum of all formally distinct products of the variables 
taken @ at a time. We also define po = 1. 


For example if n = 3 we have 
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Pola) =1 
Pila) =a, tag t+ az 
Pola) = ayag + aya3z + agag 


Pala) = ayagaz (66) 


A well known theorem of algebra states that any symmetric polynomial 
in all the a’s can be uniquely written as a polynomial in the quantities 
pi(a),i=0,...,n. 

Note that in the characteristic polynomial eq. (3) we have 


a; = (—1)'*1p;(A) (67) 


where the );,i = 1,...,n, are the roots of eq. (3). Thus the theorem just 
stated says that any symmetric polynomial in the roots of a polynomial 
can be expressed as a polynomial in the coefficients of the equation 
(rather than a complicated function as would be required to express an 
individual root). 

We shall use the notation p¥)(a), 2=0,...,n —1, to denote the @th 
elementary symmetric function formed from the (n — 1) ordered vari- 
ables a1,..., Qj—-1, Qj+1,.--, Qn. Likewise v“)(a) denotes the determi- 
nant of the corresponding (n — 1) X (n — 1) Vandermonde matrix. 


Theorem: 
(i) 
[V- 1(q)]i; = ee os (i) . jla ) (68) 
il.jJ=l,...,n 
Proof: We have 
n n i (i) 
ag * E [Vea laelV Hod] = Saf DEPP) (egy 
i=1 j= v(a) 


From eq. (65) and the definition of v (a) we see that 


v©(a) 1 
ee i 70 
va) (Ay Ty (ar — a a 
exi 
and hence eq. (69) is 
n (i) —] n-J 
eS: af py? (a)(—1)"~J (71) 
i=1 II (a; — ae) 
LAL 


Form 
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n . 
q(x) = > qrjx! 
j=l 


and note that* 


E PP (ay(—1yrii = Fe — ae) (72) 
J= L 
Thus 
7 II (x - ae) 
X qyxi-t = DO ah) =} —___ (73) 
j=l t=1 Hy, (a; — ae) 


has value a*—! when x = am,m =1,...,n. From this it follows that q(x) 
= x-1, so that gz; = 5%;, which we were to prove. 


We leave it to the reader to convince himself of the following: 


Lemma lI: 
2 p? (a) = jpn-j(a) j>0 (74) 
=1 
Lemma IT: 
pj) = pi?) = Nipf2s) J =1,2,...,n—1 (75) 
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Sequentially Companded Modulation for Low- 
Clock-Rate Speech Codec Applications 


By S. V. AHAMED 
(Manuscript received October 7, 1977) 


The expansion of the step size of a speech codec may be arranged 
to change as the number of identical consecutive bits starts to increase. 
This technique causes the codec to respond partially to a fewer number 
of identical consecutive bits and more dramatically to larger numbers. 
In contrast to a typical exponential expansion of the step size, the 
proposed technique, in addition, expands the exponent. For speech, 
two distinct advantages have been observed: (i) the improvement of 
higher frequency audio frequency response at the same clock rate and 
(it) the reduction of idle channel noise. In practice we have found that 
three- and four-bit companding will suffice for a typical 24 kHz, ADM 
codec. The proposed companding appears to be an acceptable choice 
between two-, three-, and four-bit companding which leads to better 
frequency response but worse noise, and four-bit companding which 
leads to both worse frequency and noise responses. 


Il. INTRODUCTION 


The many distinct advantages of companding to encompass the dy- 
namic range of speech signals are well documented. In most cases, a 
simple law is used repeatedly to arrive at the companded step size. For 
instance, in the 37.7 kHz, ADM SLC-40 codec,! a nonlinear function of 
the frequency of occurrence of four identical consecutive bits forces an 
expansion of the step size. In the two-bit companding described in Ref. 
2, the expansion of step size follows a geometric progression. Three bit 
companding described in Ref. 3, again depends on a base-two geometric 
series for increasing the step size, when two consecutive bits are the same, 
and again on the same series with a base-half for decreasing the step size 
when the bits are of opposite polarity. Most of these systems perform 
adequately at higher (typically above 32 kHz) clock rates. However, when 
the clock rate is decreased, the simple fixed rules of companding either 
offer an unacceptable quantization noise at lower step sizes, or make the 
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Fig. 1—Block diagram of encoder (see Ref. 4, Fig. 7). 


message so choppy at larger step sizes that it becomes unintelligible. A 
happy compromise between the two is simply impossible. For this reason, 
we have investigated the problem by having a series of weightings at- 
tached to identical consecutive bit patterns of two, three, and four bits. 
When the bit pattern reverses, the decay of the step size is effected by 
an RC circuit with a time constant of about 9 msec. 

Sequential companding proposes to utilize the binary sequence of data 
for companding the step size in a multiplicity of modes at different in- 
stants of time in a gradual way, whereas conventional encoding translates 
the companding information more drastically after a critical threshold 
has been reached. The multiple use of bit stream to convey companding 
information enhances the effective usage of bits at the same bit rate, or 
achieves the same quality of speech at a lower bit rate. 


ll. PERIPHERAL CIRCUIT FOR TESTING SEQUENTIAL COMPANDING 


An existing ADM codec! has been used to test the principle of se- 
quential companding. The encoder has a double integration feedback 
loop with the main pole at 235 Hz and the secondary pole at 2870 Hz. 
Figure 1 is a block diagram of the encoder. Four bit companding is ef- 
fected by a logic circuit which forces an incremental charge on a 1 wF 
capacitor through a 3 kQ resistance. The duration for which the charging 
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takes place equals the time during which the logic circuitry senses four 
consecutive ones or zeros at the output of the comparator, which in turn 
senses the difference between the incoming speech signal and the voltage 
across the feedback loop. When the four bits sensed are not all identical, 
the 1 uF capacitor is allowed to discharge to a 9 kQ resistance. This 
combination yields two time constants; one for charging (attack) of about 
3 msec, and one for a discharging (decay) of about 9 msec. The voltage 
across the | uF capacitor dictates the step size. A nonlinear circuit gen- 
erates a step current whose magnitude depends upon the voltage across 


CAPACITOR VOLTAGE (ACTUAL VOLTAGE <-) 
=) -6 5 4 = = = 0 


STEP CURRENT 





1 2 3 4 5 6 7 8 
CAPACITOR VOLTAGE (-» VOLTAGE DIFFERENCE) 


Fig. 2—Compander capacitor voltage and step current relationship. 


the 1 uF capacitor. Figure 2 depicts the 1 uF capacitor voltage and the 
compander current. When there is no companding at all, the voltage 
across the capacitor becomes quite low (about 1.6 V) and the minimum 
step size of the 10 vA is reached. As the voltage reaches about 6.5 volts, 
the current step size reaches about 2 mA yielding about 46 dB range for 
the step size. The compander current is used to accumulate or deplete 
a charge on a final 0.33 uF integrator capacitor and it is the voltage across 
this capacitor which yields the original voice frequency signal after a low 
pass filter with a 3 dB loss at about 2000 Hz. The final capacitor has 
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about 1.2 kQ in its discharge path. When the current step size is about 
10 vA, the voltage swings between +0.58 mV across the integrator ca- 
pacitor at a clock frequency of 24 kHz. 


lll. ESSENTIAL DIFFERENCES BETWEEN CONVENTIONALLY 
COMPANDED AND SEQUENTIALLY COMPANDED CODECS 


3.1 The idle channel noise 


The character of the idle channel noise is totally different in the se- 
quentially companded codecs. Whereas in the conventionally com- 
panded encoder, the bit pattern generated by the encoder during a silent 
period is nondeterministic and depends on the comparator character- 
istics, the sequentially companded encoder generates a sort of restless 
limit cycle in which the climax occurs when the three consecutive ones 
are produced, and the step size increases very slightly followed by pairs 
of zeros and pairs of ones for a few cycles. Meanwhile, the step size starts 
to decay due to lack of any companding, the paired zero-ones gradually 
vanish to a single zero and one combination and then the whole cycle 
repeats. This constitutes a semistable limit cycle and has been photo- 
graphed in Figure 3a. The top trace shows the audio output from the 
decoder. The central trace is the integrator voltage in the decoder. The 
last trace is the clock at 24 kHz which also triggers the oscilloscopic 
sweep. In contrast a similar oscillogram (Fig. 3b) for a conventionally 
companded codec shows a total absence of any pattern during the si- 
lence. . 

These two pictures also forecast the difference in character of the two 
idle channel noises. The sequentially companded decoder presents a 
component of frequency at about 190-200 Hz which is considerably 
quieter than random noise generated by the conventionally companded 
ADM. But the higher frequency channel noises are less than those in the 
conventionally companded ADM coded. Psychologically the effect of such 
a low-frequency steady low-level signal appears to be more tolerable than 
the randomly varying noise produced by the latter. 

Sequentially companded codecs also have one additional feature to 
enhance the signal to idle channel noise ratio. The frequency of com- 
panding in the three and four bit compander is higher than that in the 
four bit compander alone. Hence, the signal strength at the input to 
encoder can be higher at the same input frequency. The study of the four 
bit conventional codec suggests a 60 mV occasional peak at the input 
level to the codec. This value is appropriate and is consistent with an 
average voltage across to the integrator capacitor with the compander 
switch being functional (Ref. 4) for about 10 percent of the time. How- 
ever, for a sequentially companded codec, the compander switch would 
be functional almost twice as frequently. However, the charging rate of 


768 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


sae jentae 





Fig. 3—Idle channel noise characteristics of (a) sequentially companded ADM codec 
and (b) conventionally companded ADM codecs. 


the three bit compander is less than the charging rate of the four bit 
compander. Further, the switch for the four bit works in unison with the 
three bit switch. Hence, the average signal level for the sequentially 
companded codec tends to be on the order of 160-220 mV. At this level 
the sequentially companded codec at 24 kHz “sees” a frequency of 4 kHz 
the same way as a conventionally companded codec at 37.7 kHz would 
see a frequency of 4.7 kHz. But since the resistances in the charging paths 
are different, the responses would also be slightly different. Further, the 
response to the lower frequency from the sequentially companded codec 
should become nominally better* since both three- and four-bit com- 
panding can take place simultaneously. 

The signal to noise implication of this difference of behavior between 
the two codecs is that whereas the noise remains about the same for the 
sequentially companded codec, the signal level increases about two to 
three times bringing down the signal to idle channel noise ratio consid- 
erably. This result has to be observed consistently during normal func- 
tioning of the codec. 








* Experimentally we have seen very little difference at low audio frequencies which tend 
to pass through the telephone network. 
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Fig. 4—Unit function response of (a) conventionally companded codec and (b) se- 
quentially companded codec for a 50 mV step function. 


3.2 Unit function response 


This response indicates the rapidity with which the codec can respond 
to changes in the input level. It also brings about the high frequency 
response of the codec. During the normal operation the capacity of the 
sequentially companded codec to respond rapidly is reflected by lower 
slope overload noise. When a 50 mV step is imposed on the encoder, the 
decoder responses for the conventionally and sequentially companded 
codecs are presented in Figures 4a and b and output voltages are also 
tabulated in Table 1A. Similar response to a 360 mV surge is shown in 
Figs. 5a and b and the output voltages are presented in Table IB. 


3.3 Higher audio frequency response 


An audio frequency of 3149 Hz (nonsynchronous with the 24 kHz 
clock) is chosen for the comparison of performance of the two codecs. 
Figures 6a and b represent two spectrograms generated at the output. 
In the response from the conventionally companded codec (Fig. 6a), the 
peak at the input frequency is surrounded by a large number of cluttered 
peaks with rapidly changing (indicated by the density of broken patterns 
and smears in the figure) tones. Each smear is an audible change in tone 
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Table |A — Differences in responses to 50 mV step function at 


encoder 
Elapsed Conventionally Sequentially 

time, companded companded 

psec codec, mV codec, mV 
200 2.5 3.0 
300 10.0 15.0 
400 20.0 29.0 
500 32.0 47.0 
600 43.5 55.5 
800 44.0 48.0 

1000 44.0 50.0 


Table IB — Differences in responses to 360 mV step function at 


encoder 
160 19 35 
320 50 80 
480 110 180 
640 205 300 
800 320 370 
960 320 370 





Fig. 5—Unit function responses of (a) conventionally companded and (b) sequentially 
companded codecs for a 360 mV input surge. 
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FREQUENCY IN kHz 


Fig. 6—Typical responses at about 3000 Hz from (a) conventionally companded and 
(b) sequentially companded ADM codes. 


which cannot be missed by a listener. In contrast the response from the 
sequentially companded codec is cleaner and better formed with a fewer 
number of breaks and smears. Listening to the output tones from the 
two codecs also confirms this result. 


IV. COMPUTED SIGNAL-TO-NOISE RATIOS 
4.1 Validation of the computer model 


Computerized models of the conventionally and sequentially com- 
panded codecs have been developed to compute the S/N ratios. The 
model of the codec programmed for a Nova 800 minicomputer is a gen- 
eral purpose version of a typical ADM codec whose companding can be 
changed by input variables. The same model serves to compute the bi- 
nary sequence, the output wave shapes, the signal to noise ratios, etc., 
by altering the data to the minicomputer in a conversational mode of 
communication between the operator and the machine. For the encoder 
model an ideal comparator is used. Hence, the additional noise generated 
by the imperfection of the comparator is absent from the computed re- 
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Table Il — Computed and measured S/N ratios 


Measured* 
(Ref. 4), Computed,* 
Frequency dB dB 
300 38.5 40.0 
800 32.5 32.5 
1600 22.5 22.0 
* The clock rate is 37.7 kHz. 
Table Ill — 24 kHz steady-state S/N ratios 
Audio Conventional Sequential 
frequency, compandin compandin 
Hz* Bits STN Bits STN 
2900 3 7.9 
4t 5.7 3,4 8.35 
2490 3 3.7 
4t 2.1 3,4 3.9 
1660 3 16.0 
4t 13.0 3,4 15.3 
830 3 25.1 
4+ 23.2 3,4 29.4 
415 3 31.6 
4t 31.0 3, 4 31.8 


* These numbers are chosen to be irrational fractions of to clock rate at 24 kHz to avoid 
a limit cycle condition. 
+ Presently used charging resistance = 3.01 kQ. 


sults. This leads to the assertion that the computed S/N ratios would 
be the upper bound for the measured S/N ratio. These values (published 
in Ref. 4) have been used to validate the model at different sine wave 
inputs and have been presented in Table II. 

From the computational models it also becomes evident that the S/N 
ratio is by no means a constant but a time varying quantity. Whereas the 
S/N ratio is measured as a time-average over a period (typically between 
0.5 to 3 seconds), the computed S/N ratios are averaged over much 
shorter intervals and, hence, one would expect some difference between 
the computed and the measured values. Nonetheless, since the time 
constant is the same for all computations, the cross comparison of the 
computed values would still be a valid relative measure of their perfor- 
mances. To reduce the effect of transient variations of the S/N ratios, 
the computed value is the average of 60 such ratios at 60 consecutive 
clock cycles, each ratio being calculated as the moving average of twenty 
adjoining ratios around each clock cycle. Further, a series of such com- 
putations are made over numerous input frequency cycles and an average 
number is derived. 


4.2 Effect of sequential companding on steady-state sinusoidal inputs 


Table ITI lists the values of computed signal-to-noise ratios for con- 
ventionally companded and sequentially companded codecs at 24 kHz. 
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Fig. 7—S/N ratios for rapidly varying frequencies. 


The input parameters have been held substantially the same for all the 
cases presented. However, the values of the charging resistances* for 
three-bit companding has been computationally optimized as 5.2 kQ, 
whereas its corresponding value for four-bit companding has been re- 
tained as 3.01 kQ as it exists in current applications. In case of the se- 
quential companding their valuest have been optimized as 5.2 kQ for 
three bits and 4.36 kQ for four bits, even though any other resistance 
values in their proximities will perform adequately. 


4.3 Effects of sequential companding on frequency modulated sine wave 
inputs 


Steady tones are rarely encountered in telephone conversations. 
Rapidly varying frequencies are, however, typical and for this reason we 
have studied the responses of conventional and sequential coding at 24 
kHz clock rate when the input signal has a frequency which changes 
gradually from 250 Hz to 2900 Hz and back to 250 Hz within 30-50 
milliseconds. Such changes are well perceived by the listener and the 
average signal to noise ratios indicate the relative faithfulness with which 


* This charging resistance controls the voltage on the step size capacitor, which in turn 
controls the current step for charging or depleting the final integrator. 

* Experimental determination (Section III) for best subjective listening seems to indicate 
that 5.2 kQ and 3.01 kQ are the desirable values for three- and four-bit companding. 
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Fig. 8—Square-wave response to 1660 Hz signal. 


the codecs can follow the change in frequencies and thus retain the 
original message characteristics. 

The response to such a frequency characteristic is presented in Fig. 
7. The average S/N ratio during the entire cycle is 10.18 dB for the se- 
quential and 8.29 dB for the conventional encoding. The averages of 
positive S/N ratios are 17.95 dB for sequential and 14.91 dB for con- 
ventional encoding. 

To signify the uneven behavior of the conventional encoding further, 
square wave at 1660 Hz was presented at the encoder. The results are 
shown in Fig. 8. The effectiveness of the sequential companding in fol- 
lowing rapidly changing input is also illustrated in Fig. 9. Computed S/N 
ratios are plotted when a tone burst signal at 1660 Hz is presented to the 
two types of the encoders. To meet the rapidity of response of the three- 
and four-bit companding, the charging resistance (see Sec. 4.2) of the 
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Fig. 9—1660 Hz tone burst response. 
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three-bit conventional companding is adjusted from 5.2 k0Q to 4.5 kQ. 
However, the steady-state S/N ratio attained by the three- and four-bit 
sequential companding is about 18.7 dB as against 16.3 dB for the con- 
ventional companding. We have not been successful in achieving a rapid 
response and also a consistently high steady state S/N ratio from con- 
ventional companding to match the performance of the sequential 
companding. To quantify this assertion we have roughly the same ra- 
pidity of response from both codings at 830 Hz. However, the steady- 
state S/N ratio of sequential coding is 4.3 (29.4 vs. 25.1) dB higher than 
conventional coding. Complementarily, when the steady-state S/N ratios 
are approximately the same at 250 Hz, the sequential companding S/N 
ratio is roughly 9.5 (37.0 vs. 27.5) dB better than conventional com- 
panding S/N ratio 2.3 msec after the tone burst. 


V. DISCUSSION OF THE DIFFERENCES 


5.1 Experimental results 


Consider an ideal case where a series of ones is presented at the input 
data of the ADM decoder. The integrator voltage responds to an in- 
creasing step current. If the discharge of this integrator capacitor is ig- 
nored for the present, then the voltage across the integrator is directly 
proportional to the increasing current step. Qualitatively this voltage 
may be represented by curves A-E of Fig. 10. At 48 kHz the change in 
step size after the eighth “one” is h’i’, whereas the step size after the 
fourth “one’’at 24 kHz (corresponding to the same lapsed interval of 
time) is only hj. If the codec had three- and four-bit companding, the 
change in integrator voltage would have been h”j” which is greater than 
hj due to two reasons: (i) the additional companding step at d and (ii) 
the simultaneous action of both the three- and four-bit companding at 
f”. Asimple three-bit companding would have had a slower response as 
depicted by the curve D. 

Again consider the influence of the resistances in charging paths of 
the step size capacitor if there are two resistances R; and R’ in the 
charging paths energized by the three-bit and four-bit companding, then 
the slope of the curve B can be adjusted to any desired value. Different 
values of these resistances yield different ranges of these curves with one 
essential, vital difference. When the resistance is too low, the step size 
becomes too coarse leading to crackle within the word every time the 
polarity of a bit changes after a series of ones or zeros, and it is this di- 
rection in which a real compromise must be sought while adjusting the 
value of the resistances to minimize the slope overload noise. 

In essence, the sequentially companded ADM codec behavior differs 
from that of a conventionally companded codec to the extent that an 
additional factor of nonlinearity is imposed in its response. The con- 
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Fig. 10—Idealized responses of ADM codecs. 


ventionally companded ADM codec responds by the inherent nonlinearity 
between the step size capacitor voltage and the current step, whereas 
the sequentially companded codec expands the step size capacitor 
voltage itself with a nonlinear character, and this nonlinearity rides along 
the nonlinear relation of Fig. 2 at all sizes of the current steps making 
the codec more responsive and more sensitive especially at lower clock 
rates. The validity of this assertion is demonstrated in experimental 
results in Figs. 4, 5, and 6. 


5.2 Computational results 


The steady-state sinusoidal response of a sequentially companded 
codec can be matched by a conventionally companded codec operating 
at a lower number of bits of the sequentially companded codec. However, 
the rapidity of such a response cannot be achieved by conventional 
companding which can also yield a consistently higher steady state S/N 
ratio. The computational verification of this assertion is indicated in Fig. 
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9 due to consistantly lower S/N ratio from the conventionally com- 
panded codec during the cycle of frequency variation. This characteristic 
is reflected by crisper sounding words from sequentially companded 
codecs. 

Sequentially companded codecs seem to reach and hold a steady state 
more quickly and more effectively as shown in Fig. 8. The large swing 
of the S/N ratio when reaching the steady state by the conventionally 
companded codec indicates its inadequacy to yield steady tones essential 
for MFKP system and TOUCH-TONE® signals. 


VI. CONCLUSIONS 


Both experimental and computational results confirm that sequential 
companding reduces the response time from the codec even though 
conventionally companded codecs can compete well in the steady state 
response for sine wave excitations. Further, the tones generated from 
sequentially companded codecs tend to have fewer breaks and thus are 
steadier and yield higher S/N ratios at higher audio frequencies. 

Experimental results indicate that the idle channel noise from se- 
quentially companded codecs is considerably lower than the idle channel 
noise from conventionally companded codecs during actual message 
transmission. The intelligibility of the message is also better due to 
crisply formed words and lower background noise. 

Computational results indicate that when the frequency centers 
around 800 Hz and when a small frequency modulation is embedded, 
then the S/N ratio from the sequentially companded codecs is about 2-3 
dB better than an optimally designed conventional codec. 
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Gradient Encoding for Low-Bit-Rate Stored 
Speech Applications 


By S. V. AHAMED 
(Manuscript received October 7, 1977) 


In stored speech applications, the waveform of the message is com- 
pletely specified and can be effectively used to reduce the bit rate at 
which the message may be synthesized. In gradient encoding, we 
propose to maich the gradient of the output wave of the differential 
decoder with the required gradient between discrete clock cycles. When 
the required gradient is very steep the bit pattern selected maximizes 
the rate of change of the decoder voltage, otherwise appropriate bits 
of opposite polarity are inserted to match the amplitude of the decoder 
voltage with the required voltage at the discrete clock-cycles. The 
performances of gradient encoding and conventional encoding are 
presented as corresponding signal-to-noise ratios under different in- 
puts and circuit conditions. Further, our preliminary results indicate 
that gradient encoding can lead to comparable quality of speech at 


about half the bit rate of the conventional encoding between 32 to 24 
kbaud. 


|. INTRODUCTION 


For stored speech application, one of the ways of generating efficient 
binary data is tree encoding, which examines and verifies the sequences 
of a prespecified number of bits by varying the data in every possible 
combination and selecting the one that yields the best signal-to-noise 
ratio. This way of exhaustive searching for the best bit pattern demands 
a large number of computations, and the number of computations ex- 
pands geometrically as the number of bits in the tree (i.e., the number 
of sequential bits chosen to explore the range of variation of the decoder 
voltage) increases. This leads to a further uncertainty about whether the 
number of bits chosen is satisfactory or not for any given section of 
speech. 
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To circumvent this problem we have chosen to seek an alternative 
algorithm and proceed on a variable length of speech waveform deter- 
mined by the gradient around the section under investigation. This is 
accomplished by realizing that all speech wave shapes consist of peaks 
and valleys, and the duration between these successive extrema should 
guide the duration of the computation, and that the gradient between 
them should guide the fragmentation of the compute-and-match pro- 
cedure attempted during the relaxation* of bits for the intervals between 
the peaks and valleys or valleys and peaks. 


ll. THE BASIC APPROACH 


When the locations of the peaks and valleys contained in any segment 
of speech have been determined, the synthesis of the optimal bit se- 
quences may be routinely and systematically determined as follows: 

(i) Determine the change in amplitude required from the differential 
decoder and the interval for the change. 

(ii) Determine the best the decoder can accomplish by forcing a se- 
quence of zeros (for peak to valley fit) or ones (for a valley to peak 
fit). 

(iii) If the decoder can exceed the required change, halve the interval 
for the computation and evaluate the decoder performance by stuffing 
zeros or ones during half the interval. 

(iv) Proceed to repeat (ii) and (iii) until one of the following occurs: 
(a) The decoder performance comes to within a very tight tolerance level 
of what the original speech wave called for. (b) The interval for com- 
putation has collapsed to one clock cycle of the decoder and if so choose 
a (0) or (1) that minimizes the error at the end of that particular clock 
cycle. 

(v) When iv(a) or iv(b) are completed, update the new peak as the 
last point processed under iv(a) or iv(b) if the search pattern is pro- 
gressing from a peak to a valley and retain the same valley or update the 
new valley as the last point processed under tu(a) or iv)b) if the search 
pattern is progressing from valley to peak and retain the same peak. 

(vi) When the binary bits during the interval have been synthesized, 
proceed to the next section of the speech wave shape—i.e., to the next 
peak-valley or valley-peak pair. 


ill, DIFFERENCE BETWEEN CONVENTIONAL ENCODING AND 
GRADIENT ENCODING 


Conventional encoding ignores the a priori information about the 
location of the next extreme point and can make large errors in achieving 
the best performance from a decoder. Gradient encoding ignores the 


* In this context relaxation implies a systematic iterative selection. 
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basic premise of conventional encoding by forcing the next bit to be of 
opposite polarity if the present decoder voltage has exceeded the input 
signal, and retains a slight error at the present position in an overall at- 
tempt to do its best to reach the target extremity of the wave shape. 
When targets become very far apart, then the intermediate ranges of fit 
start to shrink and minimize the error at the intermediate points. In an 
extreme case of absolute silence, gradient encoding and conventional 
encoding converge in the bit selection of alternate zeros and ones. 

The anticipatory characteristics of gradient encoding also prepare the 
decoder for sudden peaks in wave shapes by sending a series of identical 
bits before arriving at the peak, so that the extreme point is within a 
predetermined range of error. This is totally absent in conventional 
encoding. 


IV. PERFORMANCE OF GRADIENT ENCODING—ALGEBRAIC WAVES 
4.1 Summed sine waves 


Figure la indicates the performance of the conventional ADM en- 
coding technique when a sampling frequency of 12 kHz has been used 
to excite the encoder which is following an input wave generated as the 
sum of two sine waves, one at 400 Hz with an amplitude of 80 mV, and 
the other at 1200 Hz with an amplitude of 100 mV. In contrast, Fig. 1b 
indicates the performance of the gradient encoder technique under the 
same conditions. In Fig. 1a it can be seen that the point 3 on the dotted 
line being very close to 3 on the full line can materially change the next 
bit polarity and thus change the ensuing bit pattern; whereas in Fig. 1b 
the gradient encoding tolerates errors at 3, 4, 5 and 6 in order to match 
the segment 2-6 on the decoder curve (dotted line) as closely with the 
section 2-6 of the input, (solid line) and concentrates the transition of 
010 near the peak where it should logically be placed. Other such vari- 
ations are also noticeable by comparing la and Lb. 


4.2 Interrupted sine wave 

The anticipatory character of gradient encoding is evident by com- 
paring Figs. 2a and 2b. The input to the two types of encoders is a sine 
wave at 1200 Hz at 100 mV interrupted at a frequency of 400 Hz. In Fig 
2a it can be noted that the encoder starts to respond by a series of fixed 
values only after the input wave has actually been presented at the en- 
coder, whereas the gradient encoder in Fig 2b starts to process the bits 
prior to actual impact of the wave, and adjust its bits accordingly. 


V. SIGNAL-TO-NOISE RATIO ANALYSIS 
5.1 Program description 
The performances of conventional and gradient encoders have been 


modeled on a Nova 800 minicomputer by a sequence of machine language 
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Fig. 1—(a) Conventional encoding. 


and Fortran programs. The numerical computations are confined toa 
block of replaceable Fortran programs and the data handling from disc 
is performed by a set of machine language subroutines. Both commun- 
icate with the operator in a conversational mode and the circuit pa- 
rameters are input controlled. It is thus possible tocompare the relative 
performance of the two types of coding schemes under any set of con- 
ditions. The results are presented in the following sections. 
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Fig. 1—(b) Gradient encoding. 


5.2 Cross comparison of performance at the same clock rate 


Four clock rates (32, 24, 16, and 12 kHz) are chosen to compare the 
performance with the ADM codec described in Ref. 1. The charging time 
constants of the step size capacitor and the levels have been optimized 
to achieve the best S/N ratios with different number of bits for com- 
panding (see Section II, Ref. 1). However, in the case of the 4 bit com- 
panding the charging time constant has been retained as 3 msecs as it 
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Fig. 2—(a) Conventional encoder. 


currently exists. Table I contains the computed S/N ratios for 32 and 
24 kbit per second clock rates and Table II contains the result from 16 
and 12 kbits per second simulations. 


5.3 Half-rate gradient encoding performance 


Tables III and IV compare the performances of the 16 and 12 kbit per 
second gradient encoding against 32 and 24 kbits per second conventional 
encoding respectively. The time constants and levels have again been 
optimized to yield the best performance from the codec. 


784 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1978 


I 
WIN EV Nay 
ny2 Esvaleret Nel ayo fh i243 141 
yi lu \ro 
HEPES 
\ 
411 
\ 


—e— INPUT TO ENCODER 
—t— DECODER OUTPUT 





Fig. 2—(b) Gradient encoder. 


VI. PERFORMANCE WITH SPEECH 


Gradient encoding outperforms conventional encoding at the same 
bit rate. As the bit rate is reduced for gradient encoding a region of in- 
difference is encountered between 50 to 60 percent of the rate of con- 
ventional encoding. The attack time constant (i.e., the product of the 
resistance for charging the step size capacitor and its value) starts to 
influence the higher frequency response but enhances the signal to noise 
ratio at the lower end of the audio frequency response and vice versa. 
A compromise is necessary to achieve the best response over the range 
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Table | — Computed S/N ratios at 32 and 24 kHz 


Clock: 32 KHz 24 KHz 
Coding: Conventional Gradient Conventional Gradient 
Audio 
frequency, No. 
Hz of S/N, S/N, S/N, S/N, 
(sine waves) bits dB dB dB dB 
5700 3 —* 10.2 — 
4 — 0.8 — 
4700 3 — 12.8 — 
4 — 13.5 — 
3950 3 9.0 8.0 13.8 
4 7.2 10.3 — 2.5 
3150 3 12.0 14.7 — 12.3 
4 1.4 13.3 — 12.4 
2900 3 9.7 15.8 7.9 10.0 
4 6.4 15.6 5.7 9.2 
2490 3 11.9 20.3 3.7 16.6 
4 12.3 15.6 2.1 12.0 
1660 3 19.6 24.5 16.0 19.3 
4 16.0 21.4 13.0 17.0 
830 3 29.7 35.2 25.1 30.0 
4 26.0 32.3 23.2 26.6 
415 3 38.5 40.7 31.6 33.0 
4 34.0 39.1 31.0 34. 


* __ indicates near-zero values 


of importance for telephone conversation. However, since the quanti- 
zation noise in gradient encoding is scarcely present due to the optimi- 
zation of the selected bit pattern, the region of indifference between 
gradient encoding and conventional encoding tends to be biased in favor 
of the former at a slight expense of the higher audio frequency response. 
Informal subjective testing has indicated that the 12 kbit per sec, 2 bit 
companded, gradient-encoded speech is comparable with the 24 kbit per 
sec, 4 bit companded, conventionally encoded speech.* However, the 
24 kbit per sec sequentially companded speech! shows a favorable margin 
of performance over the 12 kbit per sec gradient-encoded speech. 

The computation time depends on the bit rate and concentration of 
peaks and valleys. Lower frequency wave forms demand more compu- 
tations in order to perform intermediate compute-and-match attempts. 
Higher frequencies on the other hand are adequately fitted by fewer 
overall gradient matching trials. When long telephone announcements 


* This improvement has been made possible because gradient encoding does not attempt 
to maximize the signal to noise ratio but instead, matches the extremities of the wave shape. 
When the incoming wave shape offers a large cyclic change in the transition (due to change 
in pitch) of the gradient between and peak-valley or valley-peak pair together with a steep 
gradient between the points, gradient encoding ignores the cyclic variation in the gradient 
whereas an attempt to maximize the S/N (as it is done in tree encoding) tries to accom- 
modate the cyclic change and can lead to a perceptually poorer quality of speech. To this 
extent gradient encoding outperforms tree encoding. 
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Table Il — Computed S/N ratios at 16 kbits and 12 kbits/sec 


‘Bit rate: 16 kbits/sec 12 kbits/sec 
Coding: Conventional Gradient Conventional Gradient 
Audio Companding 
frequency, No. of S/N, S/N, S/N, S/N, 
Hz bits dB dB dB dB 
3150 2 —* 7.2 a 3.4 
3 — 2.8 — — 
2900 2 — 9.2 = 5.3 
3 — 4.0 — 0.3 
2490 2 — 13.9 — 5.9 
3 — 7.3 — 1.0 
1660 2 2.1 18.0 1.6 11.7 
3 0.92 19.0 — 7.5 
830 2 19.5 28.5 18.4 26.0 
3 21.2 28.4 17.8 27.8 
415 2 30.4 30.6 19.9 30.0 
3 32.2 30.2 27.5 29.3 


*__ indicates near-zero values 


Table IIl — Comparison of 12 kbits/sec gradient and 24 kbits/sec 
conventional coding 
Bit rate: 12 kbits/sec 24 kbits/sec 
Coding: Gradient Conventional 
Audio Companding Companding 
frequency, No. of S/N, No. of S/N, 
Hz bits dBi bits* dB 
3150 2 3.4 3 = 
3 eee 4 = 
2900 2 5.3 3 7.9 
3 .32 4 5.7 
2490 2 5.9 3 3.7 
3 1.0 4 2.1 
1660 2 11.7 3 16.0 
3 7.5 4 13.0 
830 2 26.0 3 25.1 
3 27.6 4 23.2 
415 2 30.0 3 31.6 
3 29.3 4 31.0 


* 92-bit companding at 24 kbits/sec leads to extremely noisy silence periods. 

t Changing the time constants of the attack circuit (see Section II, Ref. 1) changes the 
distribution of S/N ratios between the low and high audio frequencies. For instance with 
a 66 percent time constant the values of the S/N ratios are 4.8, 9.4, 9.6, 13.4, 21.6, 23.5 dB 
from 3150 to 415 Hz respectively with 2-bit companding. 

t __ indicates near-zero values 


are synthesized we have noticed a one-third second (corresponding to 
256 sixteen-bit words at 12 kbaud) of speech occasionally demanding 
as long as 20 minutes of Nova-800 minicomputer CPU time. This par- 
ticular machine has an 800-nsec cycle time and hardware floating point 
multiply-divide facility. Conversely other one-third second speech 
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Table IV — Comparison of 16 kbits/sec gradient and 32 kbits/sec 
conventional coding 


Bit rate: 16 kbits/sec 32 kbits/sec 
Coding: Gradient Conventional 
Audio Companding Companding 
frequency, No. of S/N, No. of S/N, 
Hz bits db bits dB 
3950 2 6.1 3 9.0 
3 1.0 4 7.2 
3150 2 4.7 3 12.0 
3 2.8 4 1.4 
2900 2 6.5 3 9.7 
3 4.0 4 6.4 
2490 2 8.5 3 11.9 
3 7.3 4 12.3 
1660 2 15.3 3 19.6 
3 19.0 4 16.0 
830 2 27.7 3 29.7 
3 28.4 4 26.0 
415 2 36.8 3 38.5 
3 30.2 4 34.0 


segments are synthesized in as little as four minutes. Averaged over three 
and a half minutes of speech synthesis, the computational time is roughly 
half an hour per second of real time speech signifying two thirds billion 
arithmetic operations? for every second of message. Stated alternatively 
one may expect one third million numerical functions between a typical 
peak and valley of the speech waveshape. 

The computations during the silence periods are not trivial since 
gradient encoding is always alert to the incidence of the next peak (or 
valley). During the interval the dispersion of zeros and ones alternately 
is limited to that period which is too long to prepare the decoder for the 
next peak (or valley). 


Vil. REAL TIME IMPLEMENTATION 


The real time implementation of gradient encoding is feasible in two 
distinct ways: (i) by a multiplicity of decoder circuits with feedback 
paths, each one being excited by a bit pattern of zeros or ones over a finite 
intervals and then selecting the pattern of the decoder which yields the 
waveform closest* to the waveform of the original speech or (ii) by one 
decoder circuit whose internal timing has been hastened dramatically 
by decreasing all the time constants in the circuit accordingly, and then 


t This includes the modeling of all nonlinearities as they exist in the codec, the algebraic 
representation of most circuit elements, all the address computations, changing the syn- 
chronization rates between the scanning A/D converter and the codec clock rate, etc. 

* Such as to maximize the S/N. 
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choosing that bit pattern which yields the waveform closest to that of 
the incoming speech. The former technique proposes that the data from 
the encoder transmitted to the final real time decoder is selected as the 
data of that particular real time decoder whose output came closest to 
that of the incoming speech. The latter technique proposes that the data 
of the encoder transmitted to the final real time decoder is selected as 
that binary combination of bits which had brought the waveform of the 
faster non-real-time decoder closest to that of the original speech. 

Both of these techniques explore every branch of tree encoding to 
determine which one of the binary sequences yields the best perfor- 
mance. Our preliminary estimations show that eight decoder circuit for 
implementing technique (z) and an accelerated clock rate at about 100 
kHz for a 12 kbaud data rate would be a reasonable compromise between 
complexity of the encoder design and optimality of bit configuration 
from the encoder design and optimality of bit configuration from the 
encoder. Such an arrangement is expected to enhance overall signal to 
noise ratio by 2 to 4 dB during the transmission of speech and the ac- 
celerated decoder circuits are completely capable of responding at about 
100 kHz. Further tree encoding with 3-bit look-ahead option achieves 
most of the advantages obtained by these encoding schemes. 


Vill. CONCLUSION 


The success of the gradient encoding lies in the complete knowledge 
of the incoming speech waveshape. This prior information has been 
employed to optimize the bit pattern and thus reduce the bit rate. At- 
tempts to reduce the storage requirement by one-half appear to have 
achieved an encouraging degree of success. The combined effects of se- 
quential companding and gradient encoding are being investigated for 
bit rate reductions ranging between 60 and 66 percent for the same 
quality of speech. 
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